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gatccgacca caccagcccg gcaccccgcg gggatactcg ccccgtccgc cgtccggaga 60 
tcgctgcccc gcgccaccgc ctggccggca cgctgctgcc gctacgccac cagggccgcc 120 
gcgcctgcct tcagctccac tgcgtccatt gccggacccg gcttggccac gccagccgga 180 
ggccccgcca ccgagcacct gggccgaccc cgccctggcg ccgatacgca gtcggacgcg 240 
acccggcgag cgtggttggc gacgcatggt gcggctggtc acctttggcc ttgtcggcct 3 00 
gggccggtcg ggcatgcagc gccaggaggc ccaattcgaa gcaacgatac gaaccgtcct 3 60 
gcatggcaac cacaaggtcg ccgtgctggg caaaggaggt gtgggaaaga cgtcggttgc 420 
ggcgtgcgtc ggatcgatcc ttgccgaact gcgccagcag gaccgtatcg tcgggatcga 480 
cgccgacacc gccttcggca ggctgagcag ccgaatcgat cctcgagcag ctggttcgtt 540 
ctgggagctg accaccgaca cgaatctgcg gtccttcacc gatatcaccg cgcgcctggg 600 
ccgaaattcc gcgggactgt acgtcctggc aggccagccg gcatccggtc cgcgccgggt 660 
gctcgatccg gccatctacc gcgaagccgc cctaaggttg gatcaccatt tcgcaatctc 720 
ggtgatcgac tgcggttcct ccatggaggc ggcggtcacc caggaagtat tgcgcgatgt 780 
ggatgctctg atcgtggtgt cctcgccctg ggcggatggt gcctccgctg ccgccaacac 840 
catcgaatgg ctgtcggatt atggcctgac aggtttgttg cgacgcagca tcgtggtgct 900 
caacgattcg gacggacacg ccgacaagcg caccaagtca ttgctggccc aggaattcat 960 
cgaccacggg cagcctgtgg tcgaggtgcc cttcgatccc catttgcggc ccgggggggt 102 0 
catcgatatg agccacgaaa tggccccgac gacgcggctg aaaatcctgc aggtcgccgc 1080 
gacggtgacg gcgtacttcg cgtcgcgacc cgccgacgca cacggcagcc cgccccggtg 1140 
acctggctgg ctgacccgg't cggcaacagc aggatcgccc gagcgcaggc ctgcaaaacg 12 0 0 
tcaatctcgg cgcccatcgt cgaatcctgg cgggcgcaac gcggcgcgca atgtggacag 1260 
cgcgagaaat cttgtcgatg ttctcgcgct gtccacatcc agggcatctc accgccactg 1320 
ttccgcagac ccctcgaacc agcggtccag gcggcggttg cgtcatgccg attgggcaga 13 80 
cacccggtgg tcgcgcaccg ggtaaccgtt gcgctcggcc agggatcgca gctggcccaa 144 0 
cgcgaatgcc cgcgcccggc ctgattcggg aattacgacc cctgcccaca gcccttccgc 1500 
acccgcggac tcgacggcgt cgcgtgcaca cagccaccgg cgcgggcaag cccggcacag 1560 
ggtcttggcc tcgtcgtcgg gagtcgtcgt ccaacgatcg ggatcttgcg tgcaaacgcc 1620 
gagcgggacc tcatacaggg cggttactgt catgtctacg ttcctccaga aagcgttgca 1680 
ggttgtagcc tctgccgcga aagcgtatcg cattaaccat agcgatgcaa cagtttcctc 1740 
ctctgcctgc ctagcggtgc tgcggctccg gttcggcgag ctccgagctc tagtgcgcgc 1800 
accgccgagt accagggcat agatcctgtt aatcagctgt gtatctggcc tcgccggcgc 1860 
gtatccgacc ccttcgggca gatcttccag gaaaagtgtt ctgacatgcg acagttcagg 1920 
tgtgaagtga actgtagcgg cagttcggtt tggctaggaa actatttcca tagcgggccg 1980 
tcgcgtcgct agatccaaaa tgtagcgaag tcatagcagt agaagggtgc aacggttagg 2040 
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atggcgggcg 
acgcagatgt 
ggcacatcag 
gtcggcaccc 
gatggacggt 
ggcagctcgg 
aaggcgttga 
caaccatcag 
cccggcgtgg 
caacgcagct 
ggccgtagtt 
9ctggaacca 
gcgtcccccg 
gccgccgtag 
ttcaccgaac 
caagcaaccc 
cgccaccacg 
cgcttatatg 
ggcgtcgcag 
accgaagcga 
aagtacaggc 
cgtttgacgg 
cgtgcaccaa 
atgatggcgc 
tttgcaagac 
ccgacccagg 
tgggactcgc 
agaacatcgc 
acatgatctc 
agaagatgat 
gccacttgtg 
gcggcgcatt 
ttctcggcag 
tgcccagcct 
tccccggcct 
gcccgttgtt 
ccatccccgg 
tgccgggtct 
ctgagctggc 
ttggcaatct 
gtcagctgca 
gccaacaagc 
tcgtgagcga 
acgctggcac 
agtcgccagc 
agaggagggt 
tggcgggcat 
ggattagcgg 
tgcaagagtt 
gactggccaa 
ccggtgttat 
gggcaccgcc 
cgatcggtta 
agaggatctg 
gctcgacctc 
gccagatcgg 
cttcgtcgtg 
ggtgctgcag 
gcccgccgaa 
cacaaccgcg 
cgagatcgtg 



agcggaaagt 
ctaaagcagg 
acgtgattga 
agatcgacga 
ggcggatcgt 
tgaccgtcag 
cgttcgaagt 
cggacctgtc 
ttcgcgcagg 
tggcggccga 
ggccccggga 
tcgcgcgaat 
gactccggcc 
acggctgcag 
gtgccgcgcc 
gcatcagccg 
acgaattaat 
ccgcacggcg 
aagaaatgat 
tcgaagcact 
acgccgtaac 
tgcgttccat 
cccccctccg 
accggaacgg 
gacgagtaac 
cgacattttc 
cattccgggc 
gcaacaactt 
gaatcaggcc 
tgacggtgtc 
gtcgtgggag 
gctctatcta 
gctgatcgag 
gcccgacatc 
gcccgacatc 
ccccgacctc 
gttccccgca 
gggcgacctg 
cgctttgcct 
gctcagcttt 
acagctcgtg 
gcaactgatc 
caagaaggaa 
cgcggccagc 
aggtctgtgc 
tcaacccatg 
gcacaacgag 
acgagttcag 
tgagaccacc 
taatctgctc 
cgacaagatt 
gacaacgtgg 
cacctggttg 
cgagacatcg 
cacggggagc 
accttggagg 
tgccgcaggg 
ttggtggcgc 
cccgccaacg 
tcccaattga 
ggtaacccga 



ctgcccaccg 
gtcgactgtc 
gccccgtcgc 
ttctcgaatc 
cggcaacatc 
cgataagacc 
cgtcaggccg 
ggacgacccg 
ggcggccgcg 
cgggatcatc 
acggacccgg 
ccgtcggggc 
tgccgacggc 
tctggctatc 
gatccttgct 
gattaccccg 
gaggctggga 
gcgcgcgaac 
cgtcggcgcc 
gatccgtcag 
gctcaagttc 
cgtgccgcga 
agcgctgcta 
cgagcgtaag 
tttatttggg 
aacaccggtt 
accaactgga 
cgcgcacagg 
aaatacgtct 
tacaaggttt 
ctcgcaatcc 
acgattatga 
atgttgacga 
atcgacggcc 
ccgggcctac 
ccgtcgttcc 
ctgcccgggt 
ctgcccggcg 
gacttcttgg 
gccagtttgc 
gcggccggcg 
tcgtcgcagg 
gacgaggaag 
caacgggggc 
catagcgagt 
accggatttc 
atcgtgggtg 
cttacccatg 
cgtagcagca 
gcagccgccg 
ttcggttgat 
tcggcgtcga 
atttccctgt 
tctgggaaca 
cccaaccgac 
gtcgctggtg 
gcgaccgcca 
cgcaggtcgg 
tcgaacccct 
cgcaatacgg 
ccggctgggt 



tcccggctag 
ggaccggcgc 
ggtgtcgcga 
tctcagacac 
ccgagaggta 
ctaatccgat 
tcggattccg 
gcgcacaacg 
gctgcgcgcc 
aacgcgggcg 
gcaaaactcg 
gagcccaccg 
ccggcgtcct 
gcagcgttgc 
gatttgcgcc 
gaattgatca 
gcaaccgccc 
ctttccaccc 
gaagccgagg 
atcaattgag 
aacggtccgg 
acttgaaaac 
agctcaatgt 
gaaacacata 
gccagttact 
cgtcgctgtt 
tcggccaagc 
tgatgggcga 
ccgatacgcg 
gtaagggcct 
ctaitgtccgg 
cgctgatgaa 
ccttgccaaa 
tctggccgcc 
ccgacttcaa 
cagggttccc 
tgcccagcat 
taggcgattt 
gcggcttcgc 
ccaccgtggg 
gtggccccag 
cccagcaagg 
gcgtggccga 
aggaggggac 
cgaagccata 
tcggtgtcgt 
atatcaaaag 
gttcgttcac 
cgggcacggg 
gcgcctaccfc 
catgacgggt 
ggtaaccatc 
cacgcttggg 
ggtgcagcgt 
ggtcgcggag 
gcggcgcgac 
tgtgatcgcg 
cttggcgggc 
gacgggtgtg 
tatcgcaccg 
ggagatcgtt 



tacccgcgaa 
cgctggtcgc 
tcattggcca 
atctgcgagc 
tgttcgtcgg 
tcggcgatcc 
ctgcacagca 
ctgcgccggt 
gtcgtgaact 
cgctcatcgc 
aagaagtgct 
agcccgcaac 
tgatcgcgca 
cggcgaccga 
agctcgaggc 
aggcgttggg 
ctggtgccac 
tggagactgc 
aagagttgcc 
gtcggctccg 
ggaacgcgcg 
gccagcgtca 
gcagtgcaaa 
aatggcatcg 
cttgcttgga 
caaacaaatc 
ggcggaagct 
tctcgacaaa 
cgacgtcctg 
cgaaaagatrt- 
catcgcgatg 
tgcgaccaac 
gttccccggc 
gaagttgccc 
atggccgccc 
cgggttcccg 
tcccaacttg 
gggcaagtta 
cggcctgccc 
tcaggtgacc 
ccaactggcc 
aggccagcag 
ggcggagcgt 
cgtcctttga 
gcgagtagaa 
gccttcgttc 
ggcgaccgat 
gtcgaaattc 
tttgcaggga 
caaggccgac 
ccgtccgctg 
gacggcatgt 
attcggccga 
gacctcacag 
atggtcgaaa 
attggcggcg 
gcgcgcgacg 
atggtgacag 
gcaaccgagc 
gcctcggccc 
gccagccaac 



taagggatca 
gtgcagcggc 
ctcgtgccga 
ggtatccgat 
cggacgacgc 
ccctggaggc 
cggccgcgta 
cgcaccggac 
tgacatcagc 
gttcgagaaa 
gcagtggccc 
aaaccccgac 
ggctgtcacc 
ggaccccgag 
gattgccgtc 
cgcggtacgt 
actggcgcag 
ccaagcggcc 
agccgaggcc 
agcgtcccac 
cgttctccgg 
ccaaaaaatt 
ggtgcagata 
ggtagcggtc 
gagggaatcc 
agcgacaaaa 
tacctaaacc 
ttaaccggca 
cgggccatga 
ccgctgctcg 
gccgttgtcg 
ctgaggggaa 
ctgcccgggt 
gacattccga 
acccccggca 
gagttccccg 
ttccccggct 
cccacctgga 
agcttgggtt 
gccaccatgg 
agcatgggca 
cacgccaccc 
gcacccatcg 
tcggacaccg 
agttaaacgt 
ctgaaggtgc 
acggtcgccg 
aatgacacgc 
gtcaccagcg 
gatggcctag 
caggccgcgc 
tggtgatcgc 
atatcccgca 
cgcaaggggt 
ccctgggcag 
tcatggtgcg 
gcgacatgct 
cggtgctggg 
tagccgaatg 
gcgtctatgc 
gccaccccgg 



2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 

3 06 0 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
39O0 
3960 

4 02 0 
4080 
4140 
42O0 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
48 00 
4860 
4920 
4980 
5040 
5X00 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
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cggcaccacg acgcagaccg acgccgccgc 
ggtgtcgctt ccccgccgtg ttggaggcga 
gcagaacttg gagcgtgcgc tggacggctt 
agatcacacc tcagatcacg cacaagcctc 
acgacttcag aaagggacgc catggtggac 
ctcgacgccc tcgatttctc cgccgcccac 
gacgactatg cgccggtgca gaccgatgac 
ctcaccgaac gcgacgagga gccggagctg 
tcggtgtcgg tctcaaccct gatggacggc 
gcgaccagca tgtccgaagc gcagctggcc 
cgccaaaagg cgcgggcgtc gcagtacacg 
gacgaagacg cagaaggcag cgccctgctg 
ccgacgccgg aagaggctgc cgcagccgaa 
gactacacct cccggtacaa ggccgatgac 
cgtcagcatg ttgccgatgt cggaggcgcg 
ctacgacgaa tccgcttgcg acgcatggat 
ggtgacgctg tttcgcgcct ggtattcgcg 
ccagatctcg atgagcacgt taaacgccag 
cacctacccg gtcacctcgc cgctagcgat 
gcaaggcaat tacgccgacg ccatggaggc 
gcacctggtg gcgtggatga aggcggttgt 
gatcgaccag gtcaagagtg ctgggaaatg 
tgtggcgcac ggggttgccg cggcaaacct 
caccgaggcc aacgactcgc ccgccggtga 
ggcgatggca cggcgcagcc agggcaacga 
acagaccact caccccgagc ccaaagtggc 
gaagacgacc accgccgaac agatcgcatc 
cgtgaccgac aactccggcc gggagcggct 
ccaaattggg ctcacccggg ttaaaaatca 
ggcccgggtc cgcgccgcca agggtatgaa 
caccggaccg cccggtaccg gcaagaccac 
cggcttaggc gtcattgccg aacccaaact 
cgagtacgag gggcaatcgg cggtcaagac 
ggtgcttttc atcgacgagg cttatgcgct 
gttcggtcaa gaggcgctgg acacgctgct 
ggtggtgatc atcgccgggt acagctccga 
tctgcggtcg cggttcgcca ctcgcatcga 
cgagatcgcc aacgtcattg ccgctgctga 
gaactttctt caggccgcca agcagttgga 
ggacgtcgcc ggcaacggtc ggtatgcgcg 
ggacatgcgt ctagcccagg tcctcgatat 
gatcaacggc tcagatatgg cggaggctat 
agaatgaact atggggcttc gcctcaccac 
gctgcgccgg ctcgaacacg ccatcgtgcg 
gcagttctac agccgctcga tcgctcttgg 
tgccgcgctg ctggcgtact tcaaaccaca 
cgaccgcgcg accaaccagc tttacgtgct 
cctgacttcg gcgcggctgg tgctgggcaa 
cgaactgagc aagctgccga tgggccagac 
gcctgtttcg gcgggcagca cctcgatctg 
ctccacttcc ccggtagtgc agaccgcggt 
gatcgatccg ctccagtcac acgaagcggt 
cgtcacaact aagggacgcc acgccataga 
ggggataccg gtgacggcca ggccaacccc 
tgatatgggg ccctggcagc tgccgccgat 
cctacctgat gatctagtga tcggatcggt 
atactatgtg gtgctgcccg acggcatcgc 
gcgcgccacc caggcgcacg ggctggtcgc 
cagaatcgcc gaacgggtat acccctcacc 
ccggccgcag gatcccgcgc tgtgctggtc 
gcagtcaacg gtgctgtccg gccggcatct 



tggcgtcctg gactccaagc tcggtaggct 5760 
cctgtacgga agcttcctgc ccggcactca 5820 
gctagagctg ctccctgcgg gcgcttggct 5880 
ctcccgaggc tgacccctca catctccgct 5940 
ccgccgggca acgacgacga ccacggtgat 6000 
accaacgagg cgtcgccgct ggacgcctta 6060 
gccgaaggcg acctggacgc cctccatgcg 6120 
gagttgttca cggtgaccaa ccctcaaggg 6180 
agaatccagc acgtcgagct gacggacaag 624 0 
gacgagatct tcgttattgc cgatctggcc 63 00 
ttcatggtgg agaacatcgg tgaactgacc 63 60 
cgggaattcg tggggatgac cctgaatctg 6420 
gccgaagtgt tcgccacccg ctacgatgtc 6480 
tgatcgcttg gccagtctgt tcgaaagcgc 6540 
gtcgctagat ctgttcaccg agatcaccaa 6600 
cggccggatc cggtgtgggg acaccgaccg 6660 
ccgcaatttc ggacagttgt cgggatcggt 6720 
gattgccatc ggggggctgt acggcgatat 6780 
caccatgggc tttgccgcat gcgaggcagc 6840 
cttagaggcc gccccggtcg cgggttccga 6900 
ctacggcgcg gccgaacgct ggaccgacgt 6960 
gccggacaag tttttggccg gcgcggccgg 7020 
ggccttgttc accgaagccg aacgccgact 7080 
ggcgtgtgcg cgcgccatcg cctggtatct 714 0 
aagcgccgcg gtggcgctgc tggaatggtt 7200 
tgcggcgctg aaggatccct cctaccggct 7260 
ccgcgccgat ccctgggatc cgggcagtgt 7320 
gctcgccgag gcccaagccg aactcgaccg 73 80 
gattgaacgc taccgcgcgg cgacgctgat 744 0 
ggtcgcccag cccagcaagc acatgatctt 7500 
gatcgcgcgg gtggtggcca atatcctggc 7560 
cgtcgagacg tcgcgcaagg acttcgtcgc 762 0 
cgctaagacg atcgatcagg cgctgggcgg 7680 
ggtgcaggaa agagacggcc gcaccgatcc 774 0 
ggcgcggatg gagaacgacc gggaccggct 78 00 
catagatcgg ctgctggaaa ccaacgaggg 786 0 
gttcgacacc tattcccccg aggaactcct 792 0 
tgattcggcg ttgaccgcag aggcggccga 7i980 
gcagcgcatg ttgcgcggcc ggcgcgccct 8040 
ccagctggtg gaggccagcg agcaatgccg 8100 
cgacaccctc gacgaagacc ggcttcgcga 8160 
cgccgc'ggtg cacgcacacc tcaacatgag 8220 
caaggttcag gttagcggct ggcgttttct 8280 
ccgggacacc cggatgtttg acgacccgct 8340 
catcgtcgtc gcggtcctga ttctggcggg 84 00 
aggcaaactc ggcggcacca gcctgttcac 84 60 
gctgtccgga cagttgcatc cggtctacaa 852 0 
tccggccaac ccggccaccg tgaagtcctc 85 80 
cgttggaatc cccggcgccc cctacgccac 8640 
gaccctatgc gacaccgtcg cccgagccga 87 00 
catcgcgatg ccgttggaga tcgatgcttc 87 60 
gctggtgtcc taccagggcg aaacctggat 8 82 0 
tctgaccgac cgcgccctca cctcgtcgat 88 80 
gatctcggag ggcatgttca acgcgctgcc 8940 
accggcggcg ggcgcgccca attcgcttgg 90 00 
cttccagatc cacaccgaca agggcccgca 90 60 
gcaggtcaac gcgacaaccg ctgcggcgct 9120 
gccaccggca atggtgccca gtctggtcgt 9180 
gctacccgat gaaccgctca agatcgtgtc 9240 
atggcaacgc agcgccggcg accagtcgcc 93 00 
gccgatatcg ccctcagcga tgaacatggg 93 60 
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gatcaagcag atccacggga cggcgaccgt ttacctcgac ggcggaaaat tcgtggcact 9420 
gcaatccccc gatcctcgat acaccgaatc gatgtactac atcgatccac agggcgtgcg 9480 
ttatggggtg cctaacgcgg agacagccaa gtcgctgggc ctgagttcac cccaaaacgc 9540 
gccctgggag atcgttcgtc tcctggtcga cggtccggtg ctgtcgaaag atgccgcact 960 0 
gctcgagcac gacacgctgc ccgctgaccc tagcccccga aaagttcccg ccggagcctc 9660 
cggagccccc tgatgacgac caagaagttc actcccacca ttacccgtgg cccccggttg 9720 
accccgggcg agatcagcct cacgccgccc gatgacctgg gcatcgacat cccaccgtcg 9780 
ggcgtccaaa agatccttcc ctacgtgatg ggtggcgcca tgctcggcat gatcgccatc 9840 
atggtggccg gcggcaccag gcagctgtcg ccgtacatgt tgatgatgcc gctgatgatg 990 0 
atcgtgatga tggtcggcgg tctggccggt agcaccggtg gtggcggcaa gaaggtgccc 9960 
gaaatcaacg ccgaccgcaa ggagtacctg cggtatttgg caggactacg cacccgagtg 10020 
acgtcctcgg ccacctctca ggtggcgttc ttctcctacc acgcaccgca tcccgaggat 10080 
ctgttgtcga tcgtcggcac ccaacggcag tggtcccggc cggccaacgc cgacttctat 1014 0 
gcggccaccc gaatcggtat cggtgaccag ccggcggtgg atcgattatt gaagccggcc 102 00 
gtcggcgggg agttggccgc cgccagcgca gcacctcagc cgttcctgga gccggtcagt 10260 
catatgtggg tggtcaagtt tctacgaacc catggattga tccatgactg cccgaaactg 10320 
ctgcaactcc gtacctttcc gactatcgcg atcggcgggg acttggcggg ggcagccggc 103 80 
ctgatgacgg cgatgatctg tcacctagcc gtgttccacc ■ caccggacct gctgcagatc 10440 
cgggtgctca ccgaggaacc cgacgacccc gactggtcct ggctcaaatg gcttccgcac 10500 
gtacagcacc agaccgaaac cgatgcggcc gggtccaccc ggctgatctt cacgcgccag 10560 
gaaggtctgt cggacctggc cgcgcgcggg ccacacgcac ccgattcgct tcccggcggc 1062 0 
ccctacgtag tcgtcgtcga cctgaccggc ggcaaggctg gattcccgcc cgacggtagg 10680 
gccggtgtca cggtgatcac gttgggcaac catcgcggct cggcctaccg catcagggtg 1074 0 
cacgaggatg ggacggctga tgaccggctc cctaaccaat cgtttcgcca ggtgacatcg 108 0 0 
gtcaccgatc ggatgtcgcc gcagcaagcc agccgtatcg cgcgaaagtt ggccggatgg 10860 
tccatcacgg gcaccatcct cgacaagacg tcgcgggtcc agaagaaggt ggccaccgac 1092 0 
tggcaccagc tggtcggtgc gcaaagtgtc gaggagataa caccttcccg ctggaggatg 10980 
tacaccgaca ccgaccgtga ccggctaaag atcccgtttg gtcatgaact aaagaccggc 11040 
aacgtcatgt acctggacat caaagagggc gcggaattcg gcgccggacc gcacggcatg 11100 
ctcatcggga ccacggggtc tgggaagtcc gaattcctgc gcaccctgat cctgtcgctg 11160 
gtggcaatga ctcatccaga tcaggtgaat ctcctgctca ccgacttcaa aggtggttca 1122 0 
accttcctgg gaatggaaaa gcttccgcac actgccgctg tcgtcaccaa catggccgag 11280 
gaagccgagc tcgtcagccg gatgggcgag gtgttgaccg gagaactcga tcggcgccag 11340 
tcgatcctcc gacaggccgg gatgaaagtc ggcgcggccg gagccctgtc cggcgtggcc 11400 
gaatacgaga agtaccgcga acgcggtgcc gacctacccc cgctgccaac gcttttcgtc 11460 
gtcgtcgacg agttcgccga gctgttgcag agtcacccgg acttcatcgg gctgttcgac 1152 0. 
cggatctgcc gcgtcgggcg gtcgctgagg gtccatctgc tgctggctac ccagtcgctg 11580 
oagaccggcg gtgttcgcat cgacakactg gagccaaacc tgacatatcg aatcgcattg 11640 
cgcaccacca gctctcatga atccaaggcg gtaatcggca caccggaggc gcagtacatc 11700 
accaacaagg agagcggtgt cgggtttctc cgggtcggca tggaagaccc ggtcaagttc 11760 
agcaccttct acatcagtgg gccatacatg ccgccggcgg caggcgtcga aaccaatggt 1182 0 
gaagccggag ggcccggtca acagaccact agacaagccg cgcgcattca caggttcacc 1188 0 
gcggcaccgg ttctcgagga ggcgccgaca ccgtgacccg cgccggcgac gatgcaaagc 11940 
gcagcgatga ggaggagcgg cgccaacggc ccgcgccggc gacgatgcaa agcgcagcga 12000 
tgaggaggag cggcgcgcat gactgctgaa ccggaagtac ggacgctgcg cgaggttgtg 12 060 
ctggaccagc tcggcactgc tgaatcgcgt gcgtacaaga tgtggctgcc gccgttgacc 12120 
aatccggtcc cgctcaacga gctcatcgcc cgtgatcggc gacaacccct gcgatttgcc 12180 
ctggggatca tggatgaacc gcgccgccat ctacaggatg tgtggggcgt agacgtttcc 12240 
ggggccggcg gcaacatcgg tattgggggc gcacctcaaa ccgggaagtc gacgctactg 12300 
cagacgatgg tgatgtcggc cgccgccaca cactcaccgc gcaacgttca gttctattgc 12360 
atcgacctag gtggcggcgg gctgatctat ctcgaaaacc ttccacacgt cggtggggta 1242 0 
gccaatcggt ccgagcccga caaggtcaac cgggtggtcg cagagatgca agccgtcatg 124 8 0 
cggcaacggg aaaccacctt caaggaacac cgagtgggct cgatcgggat gtaccggcag 12540 
ctgcgtgacg atccaagtca acccgttgcg tccgatccat acggcgacgt ctttctgatc 12600 
atcgacggat ggcccggttt tgtcggcgag ttccccgacc ttgaggggca ggttcaagat 12660 
ctggccgccc aggggctggc gttcggcgtc cacgtcatca tctccacgcc acgctggaca 12720 
gagctgaagt cgcgtgttcg cgactacctc ggcaccaaga tcgagttccg gcttggtgac 127 80 
gtcaatgaaa cccagatcga ccggattacc cgcgagatcc cggcgaatcg tccgggtcgg 1284 0 . 
gcagtgtcga tggaaaagca ccatctgatg atcggcgtgc ccaggttcga cggcgtgcac 12900 
agcgccgata acctggtgga ggcgatcacc gcgggggtga cgcagatcgc ttcccagcac 12960 
accgaacagg cacctccggt gcgggtcctg ccggagcgta tccacctgca cgaactcgac 13020 
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ccgaacccgc cgggaccaga gtccgactac cgcactcgct gggagattcc gatcggcttg 13080 
cgcgagacgg acctgacgcc ggctcactgc cacatgcaca cgaacccgca cctactgatc 13140 
ttcggtgcgg ccaaatcggg caagacgacc attgcccacg cgatcgcgcg cgccatttgt 13200 
gcccgaaaca gtccccagca ggtgcggttc atgctcgcgg actaccgctc gggcctgctg 13260 
gacgcggtgc cggacaccca tctgctgggc gccggcgcga tcaaccgcaa cagcgcgtcg 13 320 
ctagacgagg ccgttcaagc actggcggtc aacctgaaga agcggttgcc gccgaccgac 13380 
ctgacgacgg cgcagctacg ctcgcgttcg tggtggagcg gatttgacgt cgtgcttctg 13440 
gtcgacgatt ggcacatgat cgtgggtgcc gccgggggga tgccgccgat ggcaccgctg 13500 
gccccgttat tgccggcggc ggcagatatc gggttgcaca tcattgtcac ctgtcagatg 13560 
agccaggctt acaaggcaac catggacaag ttcgtcggcg ccgcattcgg gtcgggcgct 13 620 
ccgacaatgt tcctttcggg cgagaagcag gaattcccat ccagtgagtt caaggtcaag 13680 
cggcgccccc ctggccaggc atttctcgtc tcgccagacg gcaaagaggt catccaggcc 13740 
ccctacatcg agcctccaga agaagtgttc gcagcacccc caagcgccgg ttaagattat 13800 
ttcattgccg gtgtagcagg acccgagctc agcccggtaa tcgagttcgg gcaatgctga 13860 
ccatcgggtt tgtttccggc tataaccgaa cggtttgtgt acgggataca aatacaggga 13920 
gggaagaagt aggcaaatgg aaaaaatgtc acatgatccg atcgctgccg acattggcac 13980 
gcaagtgagc gacaacgctc tgcacggcgt gacggccggc tcgacggcgc tgacgtcggt 14040 
gaccgggctg gttcccgcgg gggccgatga ggtctccgcc caagcggcga cggcgttcac 14100 
atcggagggc atccaattgc tggcttccaa tgcatcggcc caagaccagc tccaccgtgc 14160 
gggcgaagcg gtccaggacg tcgcccgcac ctattcgcaa atcgacgacg gcgccgccgg 14220 
cgtcttcgcc gaataggccc ccaacacatc ggagggagtg atcaccatgc tgtggcacgc 142 80 
aatgccaccg gagctaaata ccgcacggct . gatggccggc gcgggtccgg ctccaatgct 14340 
tgcggcggcc gcgggatggc agacgctttc ggcggctctg gacgctcagg ccgtcgagtt 144 00 
gaccgcgcgc ctgaactctc tgggagaagc ctggactgga ggtggcagcg acaaggcgct 14460 
tgcgigctgca acgccgatgg tggtctggct acaaaccgcg tcaacacagg ccaagacccg 14520 
tgcgatgcag gcgacggcgc aagccgcggc atacacccag gccatggcca cgacgccgtc 14580 
gctgccggag atcgccgcca accacatcac ccaggccgtc cttacggcca ccaacttctt 14640 
cggtatcaac acgatcccga tcgcgttgac cgagatggat tatttcatcc gtatgtggaa 147 00 
ccaggcagcc ctggcaatgg aggtctacca ggccgagacc gcggttaaca cgcttttcga 147 60 
gaagctcgag ccgatggcgt cgatccttga tcccggcgcg agccagagca cgacgaaccc 14820 
gatcttcgga atgccctccc ctggcagctc aacaccggtt ggccagttgc cgccggcggc 14 8 80 
tacccagacc ctcggccaac tgggtgagat gagcggcccg atgcagcagc tgacccagcc 14 940 
gctgcagcag gtgacgtcgt tgttcagcca ggtgggcggc accggcggcg gcaacccagc 15000 
cgacgaggaa gccgcgcaga tgggcctgct cggcaccagt ccgctgtcga accatccgct 150 60 
ggctggtgga tcaggcccca gcgcgggcgc gggcctgctg cgcgcggagt cgctacctgg 15120 
cgcaggtggg tcgttgaccc gcacgccgct gatgtctcag ctgatcgaaa agccggttgc 15180 
.cccctcggtg atgccggcgg ctgctgccgg atcgtcggcg acgggtggcg ccgctccggt 15240 
gggtgcggga gcgatgggcc agggtgcgca atccggcggc tccaccaggc cgggtctggt 153 00 
cgcgccggca ccgctcgcgc aggagcgtga agaagacgac gaggacgact gggacgaaga 153 60 
ggacgactgg tgagctcccg taatgacaac agacttcccg gccacccggg ccggaagact 15420- 
tgccaacatt ttggcgagga aggtaaagag agaaagtagt ccagcatggc agaigatgaag 15480 
accgatgccg ctaccctcgc gcaggaggca ggtaatttcg agcggatctc cggcgacctg 1554 0 
aaaacccaga tcgaccaggt ggagtcgacg gcaggttcgt tgcagggcca gtggcgcggc 15600 
gcggcgggga cggccgccca ggccgcggtg gtgcgcttcc aagaagcagc caataagcag 15660 
aagcaggaac tcgacgagat ctcgacgaat attcgtcagg ccggcgtcca atactcgagg 15720 
gccgacgagg agcagcagca ggcgctgtcc tcgcaaatgg gcttctgacc cgctaatacg 15780 
aaaagaaacg gagcaaaaac atgacagagc agcagtggaa tttcgcgggt atcgaggccg 15 840 
cggcaagcgc aatccaggga aatgtcacgt ccattcattc cctccttgac gaggggaagc 15 900 
agtccctgac caagctcgca gcggcctggg gcggtagcgg ttcggaggcg taccagggtg 15 960 
tccagcaaaa atgggacgcc acggctaccg agctgaacaa cgcgctgcag aacctggcgc 16020 
ggacgatcag cgaagccggt caggcaatgg cttcgaccga aggcaacgtc actgggatgt 16080 
tcgcataggg caacgccgag ttcgcgtaga atagcgaaac acgggatcgg gcgagttcga 1614 0 
ccttccgtcg gtctcgccct ttctcgtgtt tatacgtttg agcgcactct gagaggttgt 162 00 
catggcggcc gactacgaca agctcttccg gccgcacgaa ggtatggaag ctccggacga 16260 
tatggcagcg cagccgttct tcgaccccag tgcttcgttt ccgccggcgc ccgcatcggc 1632 0 
aaacctaccg aagcccaacg gccagactcc gcccccgacg tccgacgacc tgtcggagcg 16380 
gttcgtgtcg gccccgccgc cgccaccccc acccccacct ccgcctccgc caactccgat 16440 
gccgatcgcc gcaggagagc cgccctcgcc ggaaccggcc gcatctaaac cacccacacc 16500 
ccccatgccc atcgccggac ccgaaccggc cccacccaaa ccacccacac cccccatgcc 16 560 
catcgccgga cccgaaccgg ccccacccaa accacccaca cctccgatgc ccatcgccgg 16 62 0 
acctgcaccc accccaaccg aatcccagtt ggcg'cccccc agaccaccga caccacaaac 16680 
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gccaaccgga gcgccgcagc aaccggaatc accggcgccc cacgtaccct cgcacgggcc 16740 
acatcaaccc cggcgcaccg caccagcacc gccctgggca aagatgccaa tcggcgaacc 16800 
cccgcccgct ccgtccagac cgtctgcgtc cccggccgaa ccaccgaccc ggcctgcccc 16860 
ccaacactcc cgacgtgcgc gccggggtca ccgctatcgc acagacaccg aacgaaacgt 1692 0 
cgggaaggta gcaactggtc catccatcca ggcgcggctg cgggcagagg aagcatccgg 16980 
cgcgcagctc gcccccggaa cggagccctc gccagcgccg ttgggccaac cgagatcgta 17.040 
tctggctccg cccacccgcc ccgcgccgac agaacctccc cccagcccct cgccgcagcg 17100 
caactccggt cggcgtgccg agcgacgcgt ccaccccgat ttagccgccc aacatgccgc 17160 
ggcgcaacct gattcaatta cggccgcaac cactggcggt cgtcgccgca agcgtgcagc 17220 
gccggatctc gacgcgacac agaaatcctt aaggccggcg gccaaggggc cgaaggtgaa 172 80 
gaaggtgaag ccccagaaac cgaaggccac gaagccgccc aaagtggtgt cgcagcgcgg 17340 
ctggcgacat tgggtgcatg cgttgacgcg aatcaacctg ggcctgtcac ccgacgagaa 174 00 
gtacgagctg gacctgcacg ctcgagtccg ccgcaatccc cgcgggtcgt atcagatcgc 17460 
cgtcgtcggt ctcaaaggtg gggctggcaa aaccacgctg acagcagcgt tggggtcgac 17520 
gttggctcag gtgcgggccg accggatcct ggctctagac gcggatccag gcgccggaaa 175 80 
cctcgccgat cgggtagggc gacaatcggg cgcgaccatc gctgatgtgc ttgcagaaaa 17640 
agagctgtcg "cactacaacg acatccgcgc acacactagc gtcaatgcgg tcaatctgga 17700 
agtgctgccg gcaccggaat acagctcggc gcagcgcgcg ctcagcgacg ccgactggca 17760 
tttcatcgcc gatcctgcgt cgaggtttta caacctcgtc ttggctgatt gtggggccgg 17820 
cttcttcgac ccgctgaccc gcggcgtgct gtccacggtg tccggtgtcg tggtcgtggc 17880 
aagtgtctca atcgacggcg cacaacaggc gtcggtcgcg ttggactggt tgcgcaacaa 17940 
cggttaccaa gatttggcga gccgcgcatg cgtggtcatc aatcacatca tgccgggaga 18000 
acccaatgtc gcagttaaag acctggtgcg gcatttcgaa cagcaagttc aacccggccg 18060 
ggtcgtggtc atgccgtggg acaggcacat tgcggccgga accgagattt cactcgactt 18120 
gctcgaccct atctacaagc gcaaggtcct cgaattggcc gcagcgctat ccgacgattt 18180 
cgagagggct ggacgtcgtt gagcgcacct gctgttgctg ctggtcctac cgccgcgggg 18240 
gcaaccgctg cgcggcctgc caccacccgg gtgacgatcc tgaccggcag acggatgacc 183 00 
gatttggtac tgccagcggc ggtgccgatg gaaacttata ttgacgacac cgtcgcggtg 183 60 
ctttccgagg tgttggaaga cacgccggct gatgtactcg gcggcttcga ctttaccgcg 18420 
caaggcgtgt gggcgttcgc tcgtcccgga tcgccgccgc tgaagctcga ccagtcactc 184 80 
gatgacgccg gggtggtcga cgggtcactg ctgactctgg tgtcagtcag tcgcaccgag 18540. 
cgctaccgac cgttggtcga ggatgtcatc gacgcgatcg ccgtgcttga cgagtcacct 18600 
gagttcgacc gcacggcatt gaatcgcttt gtgggggcgg cgatcccgct tttgaccgcg 18660 
cccgtcatcg ggatggcgat gcgggcgtgg tgggaaactg ggcgtagctt gtggtggccg 18720 
ttggcgattg gcatcctggg gatcgctgtg ctggtaggca gcttcgtcgc gaacaggttc 187 80 
taccagagcg. gccacctggc cgagtgccta ctggtcacga cgtatctgct gatcgcaacc 18840 
gccgcagcgc tggccgtgcc gfctgccgcgc ggggtcaact cgttgggggc gccacaagtt 18900 
gccggcgccg ctacggccgt gctgtttttg accttgatga cgcggggcgg ccctcggaag 18960 
cgtcatgagt tggcgtcgtt tgccgtgatc accgctatcg cggtcatcgc ggccgccgct 1902 0 
gccttcggct atggatacca ggactgggtc cccgcggggg ggatcgcatt cgggctgttc 19080 
attgtgacga atgcggccaa gctgaccgtc gcggtcgcgc ggatcgcgct gccgccgatt 1914 0 
ccggtacccg gcgaaaccgt ggacaacgag gagttgctcg atcccgtcgc gaccccggag 192 00 
gctaccagcg aagaaacccc gacctggcag gccatcatcg cgtcggtgcc cgcgtccgcg 192 60 
gtccggctca ccgagcgcag caaactggcc aagcaacttc tgatcggata cgtcacgtcg 19320 
ggcaccctga ttctggctgc cggtgccatc gcggtcgtgg tgcgcgggca cttctttgta 193 80 
cacagcctgg tggtcgcggg tttgatcacg accgtctgcg gatttcgctc gcggctttac 1944 0 
gccgagcgct ggtgtgcgtg ggcgttgctg gcggcgacgg tcgcgattcc gacgggtctg 195 00 
acggccaaac tcatcatctg gtacccgcac tatgcctggc tgttgttgag cgtctacctc 19560 
acggtagccc tggttgcgct cgtggtggtc gggtcgatgg ctcacgtccg gcgcgtttca 19620 
ccggtcgtaa aacgaactct ggaattgatc gacggcgcca tgatcgctgc catcattccc 19680 
atgctgctgt ggatcaccgg ggtgtacgac acggtccgca atatccggtt ctgagccgga 1974 0 
tcggctgatt ggcggttcct gacagaacat cgaggacacg gcgcaggttt gcataccttc 19 800 
ggcgcccgac aaattgctgc gattgagcgt gtggcgcgtc cggtaaaatt tgctcgatgg 19860 
ggaacacgta taggagatcc ggcaatggct gaaccgttgg ccgtcgatcc caccggcttg 19920 
agcgcagcgg ccgcgaaatt ggccggcctc gtttttccgc agcctccggc gccgatcgcg 19980 
gtcagcggaa cggattcggt ggtagcagca atcaacgaga ccatgccaag catcgaatcg 20 040 
ctggtcagtg acgggctgcc cggcgtgaaa gccgccctga ctcgaacagc atccaacatg 2 0100 
aacgcggcgg cggacgtcta tgcgaagacc gatcagtcac tgggaaccag tttgagccag 20160 
tatgcattcg gctcgtcggg cgaaggcctg gctggcgtcg cctcggtcgg tggtcagcca 20220 
agtcaggcta cccagctgct gagcacaccc gtgtcacagg tcacgaccca gctcggcgag 20280 
acggccgctg agctggcacc ccgtgttgtt gcgacggtgc cgcaactcgt tcagctggct 20340 
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ccgcacgccg ttcagatgtc gcaaaacgca tcccccatcg ctcagacgat cagtcaaacc 20400 
gcccaacagg ccgcccagag cgcgcagggc ggcagcggcc caatgcccgc acagcttgcc 2 0460 
agcgctgaaa aaccggccac cgagcaagcg gagccggtcc acgaagtgac aaacgacgat 2 0520 
cagggcgacc agggcgacgt gcagccggcc gaggtcgttg ccgcggcacg tgacgaaggc 2 0580 
gccggcgcat caccgggcca gcagcccggc gggggcgttc ccgcgcaagc catggatacc 2 0640 
ggagccggtg cccgcccagc ggcgagtccg ctggcggccc ccgtcgatcc gtcgactccg 2 0700 
gcaccctcaa caaccacaac gttgtagacc gggcctgcca gcggctccgt ctcgcacgca 2 0760 
gcgcctgttg ctgtcctggc ctcgtcagca tgcggcggcc agggcccggt cgagcaaccc 2 0 820 
ggtgacgtat tgccagtaca gccagtccgc gacggccaca cgctggacgg ccgcgtcagt 2 0880 
cgcagtgtgc gcttggtgca gggcaatctc ctgtgagtgg gcagcgtagg cccggaacgc 20940 
ccgcagatga gcggcctcgc ggccggtagc ggtgctggtc atgggcttca tcagctcgaa 21000 
ccacagcatg tgccgctcat cgcccggtgg attgacatcc accggcgccg gcggcaacaa 21060 
gtcgagcaaa cgctgatcgg tagtgtcggc cagctgagcc gccgccgagg ggtcgacgac 2112 0 
ctccagccgc gaccggcccg tcattttgcc gctctccgga atgtcatctg gctccagcac 21180 
aatcttggcc acaccgggat ccgaactggc caactgctcc gcggtaccga tcaccgcccg 21240 
cagcgtcatg tcgtggaaag ccgcccaggc ttgcacggcc aaaaccgggt aggtggcaca 21300 
gcgtgcaatt tcgtcaaccg ggattgcgtg atccgcgctg gccaagtaca ccttattcgg 21360 
caattccatc ccgtcgggta tgtaggccag cccatagctg ttggccacga cgatggaacc 2142 0 
gtcggtggtc accgcggtga tccagaagaa cccgtagtcg cccgcgttgt tgtcggacgc 21480 
gttgagcgcc gccgcgatgc gtcgcgccaa ccgcagcgca tcaccgcggc cacgctggcg 21540 
ggcgctggca gctgcagtgg cggcgtcgcg tgccgcccga gccgccgaca ccgggatcat 21600 
cgacaccggc gtaccgtcat ctgcagactc gctgcgatcg ggtttgtcga tgtgatcggt 21660 
cgacggcggg cgggcaggag gtgccgtccg cgccgaggcc gcccgcgtgc tcggtgccgc 2172 0 
cgccttgtcc gaggtagcca ccggcgcccg cccagtggca gcatgcgacc ccgcgcccga 217 8 0 
ggccgcggcc gtacccacgc tcgaacgcgc gcccgctccc acggcggtac cgctcggcgc 21840" 
ggcggccgcc gcccgtgcgc ccgggacacc ggacgccgca gccggcgtca ccgacgcggc 21900 
ggattcgtcc gcatgggcag gccccgactg cgtccccccg cccgcatgct ggcccggcac . 21960 
accaggttgc tccgccaacg ccgcgggttt gacgtgcggc gccggctcgc cccctggggt 22 020 
gcccggtgtt gctggaccag acggaccggg agtggccggt gtaaccggct ggggcccagg 22 080 
cgatggcgcc ggtgccggag ccggctgcgg gtgtggagcg ggagctgggg taacgggcgt 2214 0 
ggccggggtt gccggtgtgg ccggggcgac cgggggggtg accggcgtga tcggggttgg 222 00 
ctcgcctggt gtgcccggtt tgaccggggt caccggggtg accggcttgc ccggggtcac 22260 
cggcgtgacg ggagtgccgg gcgttggtgt gatcggagtt accggcgctc ccgggatggg 2232 0 
tgtgattggg gttcccgggg tgatcggggt tcccggggtg atcggggttc ccggtgtgcc 2238 0 
cggtgtgccc ggggatggca cgaccagggt aggcacgtct gggggtggcg gcgacttctg 22440 
ctgaagcaaa tcctcgagtg cgttcttcgg aggtttccaa ttcttggatt ccagcacccg 22500 
ctcagcggtc tcggcgacca gactgacatt ggccccatgc gtcgccgtga ccaatgaatt 22560 
gatggcggta tggcgctcat cagcatccag gctagggtca ttctccagga tatcgatctc 22620 
ccgttgagcg ccatccacat tattgccgat atcggattta gcttgctcaa tcaacccggc 22 68 0 
aatatgcctg tgccaggtaa tcaccgtggc gagataatcc tgcagcgtca tcaattgatt 2274 0 
gatgtttgca cccagggcgc cgttggcagc attggcggcg ccgccggacc ataggccgcc 22 800 
ttcgaagacg tggcctttct gctggcggca ggtgtccaat acatcggtga ccctttgcaa 22 860 
aacctggcta tattcctggg cccggtcata gaaagtgtct tcatcggctt ccacccagcc 22920 
gcccggatcc agcatctgtc tggcatagct gcccgtcggc ctggtaatac tcatccccta 22 980 
ctgccctccc caaaccgcca gatcgcctcg cggatcaccg tccggttggc ctccggcatt 23 040 
tcacgccggc tcggccgctg gatccacccc gcgccggtat tcgcagtaac ccgttgaatc 23100 
cgcgcgcatg atgcaccgct tgggcgatca gccgggtggt cacctcgctt gcgctggccg 23160 
cgctgtcgca cggggcgctc ggtggtaacg gacgtcataa ttaaccagcg taaccgaacc 23220 
taagaccagc tagctgcggc aatattggcg accaggacta tggcgccctc cgaacccggc 23280 
cgatccatgt caaaacattg acaatgcgta ctcacgccgt gtcgggcgcg ctgaatgacc 23340 
gcattgcggc gctcattcgg tgcgtagtcg ctaccaccgc aacaatgggc ttaggccatt 23400 
ccttcgttca tcgcgcggga catggccgat aacgcagcgg tcagctgctc gcccgccgcg 234 60 
tcgttatacg cggacgccgc ggcctgcgca ttgtgcagcg cctcgttgac ccgctgagcp 23520 
accgcctcgg cacccagctt cttcagcaaa ccatcttcga tgcgcaggcc ggtgagccac 23580 
tggtgcccat tgatcgtcac ttcgacggtc tcggcttcgt cggtggcgcg gaaggatccg 23 640 
ttgttcatct gattgagcgt cccgtctagg gccgactgaa accgcgccgc cagcgtcaac 23700 
gcccgggcga catgcgggtc caattcgtcc atgctcactt cgactcctta ctgtcctggc 23760 
gccgacggtt accaatgacg gcctcggtcc atgcccgatc ctcggtgtag agcgcctcgt 23 82 0 
cttcctgctg agaacccttg gacttggcgc ccccttgtcc ctgatgcgcg gcacccatcg 23 880 
gcattcccat gccaccgccg cccagcgcgg cgccgccgcc ggcccttccc tggcctaagc 23 940 
cggcaatgtc accagcgcca gcgggccgca ccgattcggc gcccccgatc gcggatccca 24000 
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acggcgccga cggcaccccg ccgcctccac 

cgtcgcccga cagcgctgcg gcttcccgcc 

ccgggaggcc accacccggc gatccggtag 

taccgggagt cacaccggag ccgtcagacg 

gctcttgcgg aggcgggggc gggtcgatct 

ccggttccag ggctgccttg ttgttgtatt 

actccgcgta caccgggaga atttggtcgc 

cgagcccgac tatgtcttca taagtcggat 

cgacatattg agcctgcttg gccatcgcag 

gccgttgttg atcgagcgaa gcctcgcaag 

acccccggaa ccgcttgacg tcgccttgca 

caaagtgcgc gagcgatgcg ccttggtcgc 

gatccatgaa gttgggttca ccggccgtgg 

aactgtcccc tccgacggcc ccggccgatt 

ccagcgcggt cgcagcctcc tcatcaacct 

aggtcgccag acgctgccgc tctttggcac 

cggacaatac cagctgttgg gcggcgtttt 

ggacatcagt cggtgggtcc gccatcgggg 

cttgctgatc caccgtcacg gtctgcgact 

atagccatta tcgtcgctaa actgaaaggt 

gccggcatcg cgaacggatc gccctacttc 

ratggggaaa cccggcgaat cggcagctgc 

cgcagaatca cccggtcaat accgggatgc 

gccaccttta cctcctgccc gatgtgtgcg 

ggcccgactg tgaccatcgc acctgccgcc 

tgagcgtcac ccaggttgag ggcgatgtcg 

cggttcaccg tctcgccgac cagtacccca 

ttggccacca gatgctggcc ttgcagcgcg 

cccctgggtg tcttgggctt gtccgccgtg 

cccggcgcga cccggactct ggtgatggtg 

ggacctccgg gcgccgcgta ggcggcagtg 

ttctcccgca caaagccgat gtcggtggcg 

accgcgtcga caccatggct agccagtcga 

taccgggcca gtcctgcgac gccttcatcg 

gggtcggcgc gcagcacgat ccaggtccgg 

acctgctgat acagactcac cacgtccggc 

gagacgatat cggcctccaa gtcgggacag 

gtgtccagca tgtcgtcggt gtgggcttgc 

cggggaatga gctcgattac ggcgaccaga 

tctcctggct tcacggtggc cccgaccaca 

cgccgccgca accacgcgta caccgccgcc 

gtgaccgtgg ccacgatgac gcccaacgag 

gtctccaaga atgcgatgat gcatggcggg 

gtgctgaacc gcagccctaa aggatttctc 

cgcccaggcc cagggccaac gtaaggccga 

gacgatcggg acccggctcc accaccgggg 

aagcagggcc gggcggaatg tcccacgtca 

cgaccaggtc gtcgaccccg cccccggggt 

tctgcgccgg cgtcaggtcg gggaaccgct 

atgccgcggc aaacgaggtg ccggcgatgg 

tcaccggttc accggtgtcg ccgagcgcga 

cccacggtcc gtgcatcgag aacgagctgg 

ttaacaccag cggtgcgtac cacgccgggg 

gtgggtcgcc gggtgtggac gggtccggcg 

tgccggccgc gaccaccacc accacgcctt 

ccagtgaggt ttcatcgatc ggcctgctca 

tcacacccac gccgaggttg gcggcgtgca 

cggcggccgg ggtggcgttg gggtcattcg 

cctcagacgt ctgacgtagc gagagcagtc 

cggtgggcgc gggccggccc gcgatgatgg 

cagacaggcc gttaccggcc tggtcgacga 



cgccaccgag cgatgccgct ttgaccgcca 24060 
cagccgacgt cagctgcgcc gccgtgtcag 24120 
gcggaaccat cggtgcggct ggcatcccgg 24180 
gcggcatcag gaagccaggg atcaatccct 2424 0 
tgatggcggg gggaggcttc ggcgggttta 24300 
cggtcagcac cttctccgac ctctgctgat 24360 
gggccgaagg gttttccgcg taaagccgtt 24420 
gttcccgcct agcccacacg tgcagctgcg 244 80 
cgctcaattt ggccatgtgg agtatccatt 24540 
cggtagccgc atcgccttcc cagttgtcaa 24600 
gcgtcaggtt gaaagtgttc cacccatccg 24660 
ccgtttcgag cttccttgcc gcttctttga 24720 
ccaccctcgg cgtatcggtt agttcggccg 24780 
ctgcctgcac agttccttcg ccgtcgttgt 2484 0 
cgccatacgc cttggccgcg ttgcgcagcg 24900 
cggccgccag gtattcccgc atgttgtcgg 24960 
tagccgccgt gagttcgcac ggtgtgatgg 25020 
cctccacctc gttggccctg ttcaaaatct 25080 
gcgtcatatc ggatcatcct ccttagtgct 25140 
tcctgcacta atttgatgcc gcccgttcat 25200 
ggcagcgcca tctggtagcg gctttcctcg 25260 
cgatgccgcg gggtaccgat cacattgtgc 25320 
gggccgagat aggtcgtcgc attcggccac 25380 
ccgatcaacc gggcaaattc ctcgaactgt 25440 
gccgcacgca ccacgaactg ggtgaatgtc 25500 
acatcgtcga agggcatgta gaccgggcat 25560 
gctgacccga tcggcagctg gcagtggcgg 25620 
ggccgctgcc cgccaaatag gcgggcgaag 25680 
gtcagcaaca ccgtggactg cggggccatc 25740 
tggtccgcgc gcgccgacca ccatacatcc 25800 
taggcatcgc gccccttgat catcgaccat 25860 
tggtcgtagt catcgaagct gcggccacac 25920 
tcggcaatgc gcgtcgcgga cgccaccaaa 25980 
cggcgctgcg ccgatttgcg ggtgcgttcc 26040 
cggttcgccg gcgccgggtc tgtcccgatc 26100 
gctgcggtat tgccgacgcg gtagccggct 26160 
tgcaccgaca ggagctcctc caccagtccg 26220 
ccgtcgacga tgaccgtcgg cgtgaatggt 2 6280 
aactcgcctt gccagcgcac cgcaacgtga 26340 
ggttctgacg aggaatccgg gggccgtcgg 2 6400 
acccagccgg tgatccggcg gccgtagaaa 26460 
gccagcgcaa tccccgccca ccagtagcgc 26520 
gccaacgcgg aggcaagcaa ggcgtgcccg 26580 
atcggcggct cagcgcccgt ctagccagcg 26640 
cggccaccaa cgccacagcc gtaatcgggc 26700 
gtggaagtcg tctgacgttg tatggcgccg 26760 
gcgcggccac cgcatcgatg acgccggcgc 26820 
gtctcgcggt ggcggtgatc cggtggatga 26880 
gccgaagcag ggccgccaga cccgacacat 26940 
gtaccggccc ctcccggcct tgcagcgcat 27000 
cgatgttttc tgcgggcgcg gccacgtcca 27060 
gcatcccggt ctggccgata ccgccgacgc 27120 
tgacaacggt ctgcacattg ttccagccgc 27180 
ccggattctg tacgcaatcg ccaccggtgt 27240 
tgacgttgac cgcatagtcg atggatgcac 27300 
ccttgtagca ggcggcttca ctgatgttga 273 60 
ccacggcgcg ggcaagactg cggatggaac 2742 0 
ggttggcttg tgagccgacc ggttcgaagg 274 80 
gagcgtcggg cgcgacgccg acgaacccgt 27540 
atgctgtgag agtcccatgg gcatcacagt 27600 
aatcgccgcc aggttccgcc gggacccgtg 27660 
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gcgaagcgtc gacaccggtg tcgatcaccg ccaccgtcac cccggccccg gtcgcgaact 2772 0 
tgtgggcatc ggccacgccc agatacgtgt tgctccacgg cggatcgtgg aacccggacc 27780 
ccggcagcgt ggtgggcgac gcgcacaaaa cgcgctgttc ggtaggctga tccgggcccg 27840 
fccacgtcggg cggcaacgcg cccggatcga tcggcggtgg cgtgatggcc gatgcgggcg 27900 
acgcggtgag caacgccagc gccaccgtga tcagaaagat acggtgcact cccagaacac 27960 
tccattcgtt gagattcatt gcgattcatt gagctgcgtt gctaccttgg gccacttgac 28020 
ggacctgtgt gcattttaga cgtaacggct gggcaaacaa cgctgtcacg cctgggctgg 28080 
tccgccgcgc cgaccagggc gcgtaggcgc tgtacctgga ccacgccggg actcaacggt 2 814 0 
tttgctaccg cactagccga tatgcggctg ctaccaaacg atcgcggcca tgtctcggtt 2 8200 
gtctgagcac acgctgcgta tcgcggcatc gatgtcggtg gcggtgatga tctgcagatc 28260 
ctgaaccgat accggttggc ccgcacgttt ttgcgcaacc acccgggtgt cccggaaccc 28320 
ttcggcgcgt tcgatcacgt tgcgggcgaa ccgaccgttt tgcatagcgt cgataccgtg 28380 
ctgcccacta ggggtggtgt agttacggat ggtggtgacc gcgtcgagga atacctcccg 28440 
tgcggcgtca tcgagctggc tggcgcgcgg tgtagcgtag cggtgtccaa tctcgacgat 28500 
ctccaccggc gaataagact cgaaccgcag ctttcggttg aaccggccag ccaaacccgg 28560 
gttcacggtg aggaattcat ccacctgatc ctcatagccg gccccgatga aacagaagtc 28620 
gaatcggtgt gtttccaatt gaaccaggag ttgattgacc gcctccatgc cgatcatgtc 28680 
cggtgttccg tcttgatgac gttcgatcag cgagtagaac tcgtccatga aaatgattcg 2 8740 
cccgagtgac ttttcgatca gctcgttcgt cttgggtcct gactccccga tgtagtgccc 28800 
acagaagtcc gatcggcgaa cttctcgaat ttcggggtga cgcacgatcc ccatgccggc 2 8 860 
gtagatcttg ccgagcgctt cagcggtggt tgtcttacct gtgcctggtg gccccaccag 28920 
caacatgtgg ttggtctgcc cctccaccgg taggccgtgc tctaggcgca tcatgcgcac 2 8980 
ctcgagttgg tcttccagcg ccgataccgc ttgcttgacc gccgccaggc ccacctgttt 29040 
ggccagcagt tcccggccct cggctagcag ctcgccgcgc cgctgcgctg cattgtcgtc 29100 
atcgagctgg tcgcggcttt tcgccgtcga agcatcccaa cggtcggagc ggctggcgat 29160 
ggttcgttca tcggtaacaa tcaagcgcag gttcgggtcc gccagggctt ctttggcggc 2 9220 
gtcggtgagc accccgttga tggtggcctt cgacagccag atctgggcct tgtcctcctc 292 80 
atgcagttgc . cggtacacca tcccccgcac atacgccaag tcggcgacca gcagcggaat 29349 
atcggccggt ccgatcgccg cggtgagcac gtcggcgccg aaccgctccg atgacctgct 294 00 
gtgtccgatc acgtccaccc ggtccagcca gtccagggcc actcgcccct gcccgagatg 2 94 60 
ggcggcggcg tgggctgcca gcgcacaaat cgacgcggtc accgccggca tgacgatcgc 2 9520 
ctgtggcggc agatcctcgg cggccgtcga caacacgtcg ggccatcgct gcgtgacgta 295 80 
catcaggaac gcccgagcca gctgatgcca ctggtagttg cgccacgaat ccaatagctc 29640 
gcggtttgct aacagggcat cggccttcgc atactccccc gcgatcgtca acgccgacga 297 00 
cagcgccagc cccacctgag atgcgtcggt caccgtgatc ccgatggatg gtcccagctg 29760 
gacctcagcg gccaacgtcc ggccgatccg cgtggtctcg cggtgcagcc actcgctatg 29820 
ggcgttgagc tgcttaagcg aggccagatc gcggtcaccg caggcgatac gacccagcca 298 80 
cgcgtcggcc atcgacggat cggcctcggt ggcagccaca aactcaggca acgccgccac 2994 0 
gcatccctgg ccattcttga tcgtcatcgc ccgatcgaaa tgccggcgcg cagtgagtaa 30000 
atcacccatc gtgtccacca ttctcgacat cgccgccgct gtcaccgcgg ttgcaacgtg 30060 
tgtctgtcac tctgtgcctc aaattccgtt ggcaacgttc taccggccta tcgacatcgt 3012 0 
gaccggctca aggctgacat agcggttctc cgcacggaac atttccatct caaccagcca 30180 
gttttgtcct gccgcaccga ctttcaccgt tgcccgatcg atttgttcga tggtcacctc 30240 
gaagccatgc cgatcgctct cggacagcga ggtaccgggt cgggcaatgg tgatgacact 30300 
ggctggccgt ggcgtgggcg aaatcgcgac atcgacaccg ctgccttcag atttgccgtc 30360 
atcgccgttc ttgcgccgcc gcacgtactc cacgacgccg acagtggtgc gcggcgcggg 3042 0 
ccgtggtgtg ccgacgatgc tcaactgcgg catgcgtacg ctggcccaac gctcttggtc 30480 
gcgagtgtgc acacacaccc gctcaccggc accgacgacg cgaatcacga tcctcttggc 30540 
gatcgtgtcg tccgcggcca cgaagacgcg cgacagctca ccggcgtcgg taacgggaat 30600 
catcagccgg tccccgttgc tcagcttgcc aatcaacacc cccgacggtc cgatctcggt 30660 
gactagctgc gccggcaacg ggcagcgccg ctgtccgcgt aggtgtggac gtggcccgca 30720 
catgttggcc gcagccgcgg cggcttgctc accattgagc cgacgcaaga tcacactggg 30780 
cggggtaggc gccggcgtcg gtgtgcgcac ggtgatggtc gcggtgcacg tcgcgtccgg 30840 
atacaccgtt acgttctgga tgacctcatc ggcacgcagc gtccaggctt gcgagagaac 3090Q. 
ccgcgacgaa atcgcctcag ccgggtacgc atacgtcgtc atccacccgg cttcaccgcg 30960 
gatagctttc cagcgctgcg cactcccggc taccgcgtcc gaccccagcc ggcgatcaag 31020 
ctcagccaag- tctgttgcgg tggccagttt ggcgcgcaag ccctgacagc gcagggagct 31080 
ggcaacgcgt tgggcgaccg aaatggcagc ggccccaacg ctggtacgcc agcgtaaagc 31140 
ttgggtgttg ccgatcaccg gaagccgcat gatcagccac gtttcgcgcc gcccggcata 31200 
cggcggcgta ccgatctccg cgtcatacac ccgcgggtaa tcgccgacgg tgccggttcg 31260 
cgagccgaag gtgacgacgc tgattgaatc gagttccagg tccagcgggt ggcgcagcaa 31320 
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cggcgcgagc tcaacgacgt caatcacgtt gtcgctttct acggtcaccg ^^^^99^gac 31380 

citagtcgcc cggtgcgctc ggccgagaag ttgcaccgcc accaccgcga "C^Stcttg 31440 

cicglggacg cllcccccgg atcggttgtt ggccaaggta attgggtcat tccatttgac 31500 

ggglclccgl ccocgcagcc ccagtaccgc ccacgaccac gccggctgac cccaccactg 31560 

?Icgalcacc aaggcgacgc cgaccacgac agccatgacc gcacctagct sg^cgcccag 31620 

cgcccagccc gccgacgcga gcacgaacac tgtccacacc ccggcgaccc S^^tcgcact 31680 

gcgcgggctg aacccggtca gcttggacgt caacgcgccc tccgtagccg ^Scoacs^ll 

Ica^lllocl gcacaccggt ggccactgcg ccgacgaacc cgatagcgat attgcgcgcc 31800 
cggtgatc 



<210> 2 
<211> 13773. 

<212> DNA _ ■ 

<213> mycobacterium tuberculosis 

<223> Complete DNA sequence of RDl Rv3867-3877 

aJgg^ggacc cgccgggcaa cgacgacgac cacggtgatc tcgacgccct cgatttctcc 60 
SScccaca ccaacgaggc gtcgccgctg gacgccttag acgactatgc gccggtgcag 120 
ISatgacg ccgaaSg!|a cctggacgcc ctccatgcgc tcaccgaacg cgacgaggag 180 
ccaqaqctgg agttgttcac ggtgaccaac cctcaagggt cggtgtcggt ctcaaccctg 240 
atlScggcS glatccagca c|tcgagctg acggacaagg cgaccagcat gtccgaagcg 
cStglccg Icgagatctt cgttattgcc gatctggccc gccaaaaggc gcgggcgtcg 
cagtacScg? tcltlgtgga gaacatcggt gaactgaccg acgaagacgc agaaggcagc 
gccctgctic gggaS?t!|t ggggatgacc ctgaatctgc cgacgccgga agaggctgcc 
ocagccgaag S|aagtgtt cgccacccgc tacgatgtcg actacacctc ccggtacaag 
IcSatSc? ga?cg?t?gg ccagtctgtt cgaaagcgcc gtcagcatgt tgccgatgtc 
Igalgclcgg IcgcLgatc tgttcaccga gatcaccaac tacgacgaat ccgcttgcga 
cSSat? ggccggatcc ggtgtgggga caccgaccgg gtgacgctgt ttcgcgcctg 
SStcgcgc cIcaSf ttcg Scagttgtc gggatcggtc cagatctcga tgagcacgtt 
laacgccSg attgccatcl Sggggctgta cggcgatatc acctacccgg tcacctcgcc 
lllllcllll Iccltgggc? Sgccgcatg cgaggcagcg caaggcaatt acgccgacgc 
lllllllTcc Jtagaigccg ccccggtcgc gggttccgag cacctggtgg cgtggatgaa 
ggcg|t?|tc tac|gScg| ccgaacgctg gaccgacgtg atcgaccagg t-agagtgc 
?|ggaa?gg ccgSclagt ttttggccgg cgcggccggt gtggcgcacg ggg"gccgc 
aacSaacctg gccttgttca ccgaagccga acgccgactc accgaggcca acgactcgcc 
Sccggtgal Icgtglgcgc gcgccatcgc ctggtatctg gcgatggcac ggcgcagcca 
ITaclToall Igcgccgclg tggcgctgct ggaatggtta cagaccactc accccgagcc 
calagSc? gSglcgctS aSatccctc ctaccggctg aagacgacca ccgcogaaca 
aaJcqScc cgc|clga?c cllgggatcc gggcagtgtc gtgaccgaca actccggccg 
aaaacqSctg clc|cc|agg cccaagccga actcgaccgc caaattgggc tcacccgggt 
?lalaa?ca| att|aa?gc? accgcgcggc gacgctgatg gcccgggtcc gcgccgccaa 
gggtaSS gtc|ccclgc ccagcaagca catgatcttc accggaccgc ccggtaccgg 

rcISS? c-S-a ct^cgSirc |g 

Ifatgcfctg 1^ ^3^0 
cacactactg gcgcggatgg agaacgaccg ggaccggctg gtggtgatca ^^9^^=999^^ 

cStccgic Llgalcgic tgctggaaac caacgagggt ctgcggtcgc 9gttcgccac 1920 

tScatcSg ttclacacct a?tcccccga ggaactcctc gagatcgcca ^cgtcattgc 1980 

cgctgctS? gat?cggcgt tgaccgcaga ggcggccgag aactttcttc aggccgccaa 2040 

QcaStgSg ?agcgcltgt t|cgcggccg gcgcgccctg gacgtcgccg gcaacggtcg 2100 

S?gcSgc calclggtig aggccagcga gcaatgccgg gacatgcgtc tagcccaggt 2160 

cctcga?a?c gacaclctSI acgaagaccg gcttcgcgag atcaacggct cagatatggc 2220 

aaaoStSc Iccgcggtgc acgcacacct caacatgaga gaatgaacta tggggcttcg 2280 
cctcaccacc Sag|t?caig ttlgcggctg gcgttttctg ctgcgccggc tcgaacacgc 
catcgtgcgc cgSacacc? ggatgtttga cgacccgctg cagttctaca gccgctcgat 
cgctSglc a?c|tcgtcg cggtcctgat tctggcgggt gccgcgctgc tggcgtactt 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 



2340 
2400 
2460 
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caaaccacaa ggcaaactcg gcggcaccag cctgttcacc gaccgcgcga ccaaccagct 2520 

ttacgtgctg ctgtccggac agttgcatcc ggtctacaac ctgacttcgg cgcggctggt 2580 

gctgggcaat ccggccaacc cggccaccgt gaagtcctcc gaactgagca agctgccgat 2640 

gggccagacc gttggaatcc. ccggcgcccc ctacgccacg cctgtttcgg cgggcagcac 27 00 

ctcgatctgg accctatgcg acaccgtcgc ccgagccgac tccacttccc cggtagtgca 2760 

gaccgcggtc atcgcgatgc cgttggagat cgatgcttcg atcgatccgc tccagtcaca 2 82 0 

cgaagcggtg ctggtgtcct accagggcga aacctggatc gtcacaacta agggacgcca 2880 

cgccatagat ctgaccgacc gcgccctcac ctcgtcgatg gggataccgg tgacggccag 2 94 0 

gccaaccccg atctcggagg gcatgttcaa cgcgctgcct gatatggggc cctggcagct 3 000 

gccgccgata ccggcggcgg gcgcgcccaa ttcgcttggc ctacctgatg atctagtgat 3 06 0 

cggatcggtc ttccagatcc acaccgacaa gggcccgcaa tactatgtgg tgctgcccga 3120 

cggcatcgcg caggtcaacg cgacaaccgc tgcggcgctg cgcgccaccc aggcgcacgg 3180 

gctggtcgcg ccaccggcaa tggtgcccag tctggtcgtc agaatcgccg aacgggtata 3240 

cccctcaccg ctacccgatg aaccgctcaa gatcgtgtcc cggccgcagg atcccgcgct 3300 

gtgctggtca tggcaacgca gcgccggcga ccagtcgccg cagtcaacgg tgctgtccgg 3360 

ccggcatctg ccgatatcgc cctcagcgat gaacatgggg atcaagcaga tccacgggac 3420 

ggcgaccgtt tacctcgacg gcggaaaatt cgtggcactg caatcccccg atcctcgata 3480 

caccgaatcg atgtactaca tcgatccaca gggcgtgcgt tatggggtgc ctaacgcgga 3540 

gacagccaag tcgctgggcc tgagttcacc ccaaaacgcg ccctgggaga tcgttcgtct 3 600 

cctggtcgac ggtccggtgc tgtcgaaaga tgccgcactg ctcgagcacg acacgctgcc 3 660 

cgctgaccct agcccccgaa aagttcccgc cggagcctcc ggagccccct gatgacgacc 3 720 

aagaagttca ctcccaccat tacccgtggc ccccggttga ccccgggcga gatcagcctc 3780 

acgccgcccg atgacctggg catcgacatc ccaccgtcgg gcgtccaaaa gatccttccc 3 840 

tacgtgatgg gtggcgccat gctcggcatg atcgccatca tggtggccgg cggcaccagg 3 900 

cagctgtcgc cgtacatgtt gatgatgccg ctgatgatga tcgtgatgat ggtcggcggt 3 960 

ctggccggta gcaccggtgg tggcggcaag aaggtgcccg aaatcaacgc cgaccgcaag . 4 020 

gagtacctgc ggtatttggc aggactacgc acccgagtga cgtcctcggc cacctctcag 4080 

gtggcgttct tctcctacca cgcaccgcat cccgaggatc tgttgtcgat cgtcggcacc 414 0 

caacggcagt ggtcccggcc ggccaacgcc gacttctatg cggccacccg aatcggtatc 4200 

ggtgaccagc cggcggtgga tcgattattg aagccggccg tcggcgggga gttggccgcc 4260 

gccagcgcag cacctcagcc gttcctggag ccggtcagtc atatgtgggt ggtcaagttt 4320 

ctacgaaccc atggattgat ccatgactgc ccgaaactgc tgcaactccg tacctttccg 4380 

actatcgcga tcggcgggga cttggcgggg gcagccggcc tgatgacggc gatgatctgt 4440 

cacctagccg tgttccaccc accggacctg ctgcagatcc gggtgctcac cgaggaaccc 4500 

gacgaccccg actggtcctg gctcaaatgg cttccgcacg tacagcacca gaccgaaacc 4560 

gatgcggccg ggtccacccg gctgatcttc. acgcgccagg aaggtctgtc ggacctggcc 4 62 0 

gcgcgcgggc cacacgcacc cgattcgctt cccggcggcc cctacgtagt cgtcgtcgac 4 68 0 

ctgaccggcg gcaaggctgg attcccgccc gacggtaggg ccggtgtcac ggtgatcacg 4740 

ttgggcaacc atcgcggctc ggcctaccgc atcagggtgc acgaggatgg gacggctgat 4 800 

gaccggctcc ctaaccaatc gtttcgccag gtgacatcgg tcaccgatcg gatgtcgccg 4 860 

cagcaagcca gccgtatcgc gcgaaagttg gccggatggt ccatcacggg caccatcctc 4 920 

gacaagacgt cgcgggtcca gaagaaggtg gccaccgact ggcaccagct ggtcggtgcg 4 980 

caaagtgtcg aggagataac accttcccgc tggaggatgt acaccgacac cgaccgtgac 5 040 

cggctaaaga tcccgtttgg tcatgaacta aagaccggca acgtcatgta cctggacatc 5100 

aaagagggcg cggaattcgg cgccggaccg cacggcatgc tcatcgggac cacggggtct 5160 

gggaagtccg aattcctgcg caccctgatc ctgtcgctgg tggcaatgac tcatccagat 5220 

caggtgaatc tcctgctcac cgacttcaaa ggtggttcaa ccttcctggg aatggaaaag 5280 

cttccgcaca ctgccgctgt cgtcaccaac atggccgagg aagccgagct cgtcagccgg 5340 

atgggcgagg tgttgaccgg agaactcgat cggcgccagt cgatcctccg acaggccggg 5400 

atgaaagtcg gcgcggccgg agccctgtcc ggcgtggccg aatacgagaa gtaccgcgaa 5460 

cgcggtgccg acctaccccc gctgccaacg cttttcgtcg tcgtcgacga gttcgccgag 5520 

ctgttgcaga gtcacccgga cttcatcggg ctgttcgacc ggatctgccg cgtcgggcgg 55 80 

tcgctgaggg tccatctgct gctggctacc cagtcgctgc agaccggcgg tgttcgcatc 564 0 

gacaaactgg agccaaacct gacatatcga atcgcattgc gcaccaccag ctctcatgaa 5700 

tccaaggcgg taatcggcac accggaggcg cagtacatca ccaacaagga gagcggtgtc 5760 

gggtttctcc gggtcggcat ggaagacccg gtcaagttca gcaccttcta catcagtggg 582 0 

ccatacatgc cgccggcggc aggcgtcgaa accaatggtg aagccggagg gcccggtcaa 5 880 

cagaccacta gacaagccgc gcgcattcac aggttcaccg cggcaccggt tctcgaggag 594 0 

gcgccgacac cgtgacccgc gccggcgacg atgcsiaagcg cagcgatgag gaggagcggc 6000 

gccaacggcc cgcgccggcg acgatgcaaa gcgcagcgat gaggaggagc ggcgcgcatg 6060 

actgctgaac cggaagtacg gacgctgcgc gaggttgtgc tggaccagct cggcactgct 6120 
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gaatcgcgtg cgtacaagat gtggctgccg ccgttgacca atccggtccc gctcaacgag 6180 

ctcatcgccc gtgatcggcg acaacccctg cgatttgccc tggggatcat ggatgaaccg 6240 

cgccgccatc tacaggatgt gtggggcgta gacgtttccg gggccggcgg caacatcggt 63 00 

attgggggcg cacctcaaac cgggaagtcg acgctactgc agacgatggt gatgtcggcc 63 60 

gccgccacac actcaccgcg caacgttcag ttctattgca tcgacctagg tggcggcggg 6420 

ctgatctatc tcgaaaacct tccacacgtc ggtggggtag ccaatcggtc cgagcccgac 64 80 

aaggtcaacc gggtggtcgc agagatgcaa gccgtcatgc ggcaacggga aaccaccttc 6540 

aaggaacacc gagtgggctc gatcgggatg taccggcagc tgcgtgacga tccaagtcaa 6600 

cccgttgcgt ccgatccata cggcgacgtc tttctgatca tcgacggatg gcccggtttt 6660 

gtcggcgagt tccccgacct tgaggggcag gttcaagatc tggccgccca ggggctggcg 6720 

ttcggcgtcc acgtcatcat ctccacgcca cgctggacag agctgaagtc gcgtgttcgc 67 80 

gactacctcg gcaccaagat cgagttccgg cttggtgacg tcaatgaaac ccagatcgac 6840 

cggattaccc gcgagatccc ggcgaatcgt ccgggtcggg cagtgtcgat ggaaaagcac 6900 

catctgatga tcggcgtgcc caggttcgac ggcgtgcaca gcgccgataa cctggtggag 6960 

gcgatcaccg cgggggtgac gcagatcgct tcccagcaca ccgaacaggc acctccggtg 7 020 

cgggtcctgc cggagcgtat ccacctgcac gaactcgacc cgaacccgcc gggaccagag 7 0 80 

tccgactacc gcactcgctg ggagattccg atcggcttgc gcgagacgga cctgacgccg 7140 

gctcactgcc acatgcacac gaacccgcac ctactgatct tcggtgcggc caaatcgggc 7200 

aagacgacca ttgcccacgc gatcgcgcgc gccatttgtg cccgaaacag tccccagcag 7260 

gtgcggttca tgctcgcgga ctaccgctcg ggcctgctgg acgcggtgcc ggacacccat 7320 

ctgctgggcg ccggcgcgat caaccgcaac agcgcgtcgc tagacgaggc cgttcaagca 7380 

ctggcggtca acctgaagaa gcggttgccg ccgaccgacc tgacgacggc gcagctacgc 7440 

tcgcgttcgt ggtggagcgg atttgacgtc gtgcttctgg tcgacgattg gcacatgatc 7500 

gtgggtgccg ccggggggat gccgccgatg gcaccgctgg ccccgttatt gccggcggcg 7560 

gcagatatcg ggttgcacat cattgtcacc tgtcagatga gccaggctta caaggcaacc 7620 

atggacaagt tcgtcggcgc cgcattcggg tcgggcgctc cgacaatgtt cctttcgggc 7 680 

gagaagcagg aattcccatc cagtgagttc aaggtcaagc ggcgcccccc tggccaggca 7740 

tttctcgtct cgccagacgg caaagaggtc atccaggccc cctacatcga gcctccagaa 7800 

gaagtgttcg cagcaccccc aagcgccggt taagattatt tcattgccgg tgtagcagga 7 860 

cccgagctca gcccggtaat cgagttcggg caatgctgac catcgggttt gtttccggct 7 920 

ataaccgaac ggtttgtgta cgggatacaa atacagggag ggaagaagta ggcaaatgga 7 980 

aaaaatgtca catgatccga tcgctgccga cattggcacg caagtgagcg acaacgctct 8 040 

gcacggcgtg acggccggct cgacggcgct gacgtcggtg accgggctgg ttcccgcggg 8100 

ggccgatgag gtctccgccc aagcggcgac ggcgttcaca tcggagggca tccaattgct 8160 

ggcttccaat gcatcggccc aagaccagct ccaccgtgcg ggcgaagcgg tccaggacgt 8220 

cgcccgcacc tattcgcaaa tcgacgacgg cgccgccggc gtcttcgccg aataggcccc 82 80 

caacacatcg gagggagtga tcaccatgct gtggcacgca atgccaccgg. agctaaatac 8340 

cgcacggctg atggccggcg cgggtccggc tccaatgctt gcggcggccg cgggatggca 8400 

gacgctttcg gcggctctgg acgctcaggc cgtcgagttg accgcgcgcc tgaactctct 8460 

gggagaagcc tggactggag gtggcagcga caaggcgctt gcggctgcaa cgccgatggt 8520 

ggtctggcta caaaccgcgt caacacaggc caagacccgt gcgatgcagg cgacggcgca 8 580 
agccgcggca tacacccagg ccatggccac gacgccgtcg ctgccggaga tcgccgccaa 
ccacatcacc caggccgtcc ttacggccac caacttcttc ggtatcaaca cgatcccgat 
cgcgttgacc gagatggatt atttcatccg tatgtggaac caggcagccc tggcaatgga 
ggtctaccag gccgagaccg cggttaacac gcttttcgag aagctcgagc cgatggcgtc 
gatccttgat cccggcgcga gccagagcac gacgaacccg atcttcggaa tgccctcccc 
tggcagctca acaccggttg gccagttgcc gccggcggct acccagaccc tcggccaact 
gggtgagatg agcggcccga tgcagcagct gacccagccg ctgcagcagg tgacgtcgtt 
gttcagccag gtgggcggca ccggcggcgg caacccagcc gacgaggaag ccgcgcagat 

gggcctgctc ggcaccagtc cgctgtcgaa ccatccgctg gctggtggat caggccccag 9120 

cgcgggcgcg ggcctgctgc gcgcggagtc gctacctggc gcaggtgggt cgttgacccg 9180 
cacgccgctg atgtctcagc tgatcgaaaa gccggttgcc ccctcggtga tgccggcggc 
tgctgccgga tcgtcggcga cgggtggcgc cgctccggtg ggtgcgggag cgatgggcca 
gggtgcgcaa tccggcggct ccaccaggcc gggtctggtc gcgccggcac cgctcgcgca 
ggagcgtgaa gaagacgacg aggacgactg ggacgaagag gacgactggt gagctcccgt 

aatgacaaca gacttcccgg ccacccgggc cggaagactt gccaacattt tggcgaggaa 9480 

ggtaaagaga gaaagtagtc cagcatggca gagatgaaga ccgatgccgc taccctcgcg 9540 
caggaggcag gtaatttcga gcggatctcc ggcgacctga aaacccagat cgaccaggtg 
gagtcgacgg caggttcgtt gcagggccag tggcgcggcg cggcggggac ggccgcccag 
gccgcggtgg tgcgcttcca agaagcagcc aataagcaga agcaggaact cgacgagatc 
tcgacgaata ttcgtcaggc cggcgtccaa tactcgaggg ccgacgagga gcagcagcag 
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8880 
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9000 
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gcgctgfccct 
tgacagagca 
atgtcacgtc 
cggcctgggg 
cggctaccga 
aggcaatggc 
tcgcgtagaa 
tctcgtgttt 
gctcttccgg 
cgaccccagt 
ccagactccg 
gccaccccca 
gccctcgccg 
cgaaccggcc 
cccacccaaa 
atcccagttg 
accggaatca 
accagcaccg 
gtctgcgtcc 
ccggggtcac 
atccatccag 
ggagccctcg 
cgcgccgaca 
gcgacgcgtc 
ggccgcaacc 
gaaatcctta 
gaaggccacg 
gttgacgcga 
tcgagtccgc 
ggctggcaaa 
ccggatcctg 
acaatcgggc 
catccgcgca 
cagctcggcg 
gaggttttac 
cggcgtgctg 
acaacaggcg 
cpgcgcatgc 
cctggtgcgg 
caggcacatt 
caaggtcctc 
agcgcacctg 
accacccggg 
gtgccgatgg 
acgccggctg 
cgtcccggat 
gggtcactgc 
gatgtcatcg 
aatcgctttg 
cgggcgtggt 
atcgctgtgc 
gagtgcctac 
ttgccgcgcg 
ctgtttttga 
gccgtgatca 
gactgggtcc 
ctgaccgtcg 
gacaacgagg 
acctggcagg 
aaactggcca 
ggtgccatcg 



cgcaaatggg 
gcagtggaat 
cattcattcc 
cggtagcggt 
gctgaacaac 
ttcgaccgaa 
tagcgaaaca 
atacgtttga 
ccgcacgaag 
gcttcgtttc 
cccccgacgt 
cccccacctc 
gaaccggccg 
ccacccaaac 
ccacccacac 
gcgcccccca 
ccggcgcccc 
ccctgggcaa 
ccggccgaac 
cgctatcgca 
gcgcggctgc 
ccagcgccgt 
gaacctcccc 
caccccgatt 
actggcggtc 
aggccggcgg 
aagccgccca 
atcaacctgg 
cgcaatcccc 
accacgctga 
gctctagacg 
gcgaccatcg 
cacactagcg 
cagcgcgcgc 
aacctcgtct 
tccacggtgt 
tcggtcgcgt 
gtggtcatca 
catttcgaac 
gcggccggaa 
gaattggccg 
ctgttgctgc 
tgacgatcct 
aaacttatat 
atgtactcgg 
cgccgccgct 
tgactctggt 
acgcgatcgc 
tgggggcggc 
gggaaactgg 
tggtaggcag 
tggtcacgac 
gggtcaactc 
ccttgatgac 
ccgctatcgc 
ccgcgggggg 
cggtcgcgcg 
agttgctcga 
ccatcatcgc 
agcaacttct 
cggtcgtggt 



cttctgaccc 
ttcgcgggta 
ctccttgacg 
tcggaggcgt 
gcgctgcaga 
ggcaacgtca 
cgggatcggg 
gcgcactctg 
gtatggaagc 
cgccggcgcc 
ccgacgacct 
cgcctccgcc 
catctaaacc 
cacccacacc 
ctccgatgcc 
gaccaccgac 
acgtaccctc 
agatgccaat 
caccgacccg 
cagacaccga 
gggcagagga 
tgggccaacc 
ccagcccctc 
tagccgccca 
gtcgccgcaa 
ccaaggggcc 
aagtggtgtc 
gcctgtcacc 
gcgggtcgta 
cagcagcgtt 
cggatccagg 
ctgatgtgct 
tcaatgcggt 
tcagcgacgc 
tggctgattg 
ccggtgtcgt 
tggactggtt 
atcacatcat 
agcaagttca 
ccgagatttc 
cagcgctatc 
tggtcctacc 
gaccggcaga 
tgacgacacc 
cggcttcgac 
gaagctcgac 
gtcagtcagt 
cgtgcttgac 
gatcccgctt 
gcgtagcttg 
cttcgtcgcg 
gtatctgctg 
gttgggggcg 
gcggggcggc 
ggtcatcgcg 
gatcgcattc 
gatcgcgctg 
tcccgtcgcg 
gtcggtgccc 
gatcggatac 
gcgcgggcac 



gctaatacga 
tcgaggccgc 
aggggaagca 
accagggtgt 
acctggcgcg 
ctgggatgtt 
cgagttcgac 
agaggttgtc 
tccggacgat 
cgcatcggca 
gtcggagcgg 
aactccgatg 
acccacaccc 
ccccatgccc 
catcgccgga 
accacaaacg 
gcacgggcca 
cggcgaaccc 
gcctgccccc 
acgaaacgtc 
agcatccggc 
gagatcgtat 
gccgcagcgc 
acatgccgcg 
gcgtgcagcg 
gaaggtgaag 
gcagcgcggc 
cgacgagaag 
tcagatcgcc 
ggggtcgacg 
cgccggaaac 
tgcagaaaaa 
caatctggaa 
cgactggcat 
tggggccggc 
ggtcgtggca 
gcgcaacaac 
gccgggagaa 
acccggccgg 
actcgacttg 
cgacgatttc 
gccgcggggg 
cggatgaccg 
gtcgcggtgc 
tttaccgcgc 
cagtcactcg 
cgcaccgagc 
gagtcacctg 
ttgaccgcgc 
tggtggccgt 
aacaggttct 
atcgcaaccg 
ccacaagttg 
cctcggaagc 
gccgccgctg 
gggctgttca 
ccgccgattc 
accccggagg 
gcgtccgcgg 
gtcacgtcgg 
ttctttgtac 



aaagaaacgg 
ggcaagcgca 
gtccctgacc 
ccagcaaaaa 
gacgatcagc 
cgcatagggc 
cttccgtcgg 
atggcggccg 
atggcagcgc 
aacctaccga 
ttcgtgtcgg 
ccgatcgccg 
cccatgccca 
atcgccggac 
cctgcaccca 
ccaaccggag 
catcaacccc 
ccgcccgctc 
caacactccc 
gggaaggtag 
gcgcagctcg 
ctggctccgc 
aactccggtc 
gcgcaacctg 
ccggatctcg 
aaggtgaagc 
tggcgacatt 
tacgagctgg 
gtcgtcggtc 
ttggctcagg 
ctcgccgatc 
gagctgtcgc 
gtgctgccgg 
ttcatcgccg 
ttcttcgacc 
agtgtctcaa 
ggttaccaag 
cccaatgtcg 
gtcgtggtca 
ctcgacccta 
gagagggctg 
caaccgctgc 
atttggtact 
tttccgaggt 
aaggcgtgtg 
atgacgccgg 
gctaccgacc 
agttcgaccg 
ccgtcatcgg 
tggcgattgg 
accagagcgg 
ccgcagcgct 
ccggcgccgc 
gtcatgagtt 
ccttcggcta 
ttgtgacgaa 
cggtacccgg 
ctaccagcga 
tccggctcac 
gcaccctgat 
acagcctggt 



agcaaaaaca 
atccagggaa 
aagctcgcag 
tgggacgcca 
gaagccggtc 
aacgccgagt 
tctcgccctt 
actacgacaa 
agccgttctt 
agcccaacgg 
ccccgccgcc 
caggagagcc 
tcgccggacc 
ccgaaccggc 
ccccaaccga 
cgccgcagca 
ggcgcaccgc 
cgtccagacc 
gacgtgcgcg 
caactggtcc 
cccccggaac 
ccacccgccc 
ggcgtgccga 
attcaattac 
acgcgacaca 
cccagaaacc 
gggtgcatgc 
acctgcacgc 
tcaaaggtgg 
tgcgggccga 
gggtagggcg 
actacaacga 
caccggaata 
atcctgcgtc 
cgctgacccg 
tcgacggcgc 
atttggcgag 
cagttaaaiga 
tgccgtggga 
tctacaagcg 
gacgtcgttg 
gcggcctgcc 
gccagcggcg 
gttggaagac 
ggcgttcgct 
ggtggtcgac 
gttggtcgag 
cacggcattg 
gatggcgatg 
catcctgggg 
ccacctggcc 
ggccgtgccg 
tacggccgtg 
ggcgtcgttt 
tggataccag 
tgcggccaag 
cgaaaccgtg 
agaaaccccg 
cgagcgcagc 
tctggctgcc 
ggtcgcgggt 



9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 - 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
.12000 
12060 
12120 
12180 
12240 
12300 
12360 
12420. 
12480 
12540 
12600 
12660 
12720 
12780 
12840 
12900 
12960 
13020- 
13080 
13140 
13200 
13260 
13320 
13380 
13440 
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ttgatcacga 
gcgttgctgg 
tacccgcact 
gtggtggtcg 
gaattgatcg 
gtgtacgaca 



ccgtctgcgg 
cggcgacggt 
atgcctggct 
ggtcgatggc 
acggcgccat 
cggtccgcaa 



atttcgctcg 
cgcgattccg 
gttgttgagc 
tcacgtccgg 
gatcgctgcc 
tatccggttc 



cggctttacg 
acgggtctga 
gtctacctca 
cgcgtttcac 
atcattccca 
tga 



ccgagcgctg 
cggccaaact 
cggtagccct 
cggtcgtaaa 
tgctgctgtg 



gtgtgcgtgg 13500 
catcatctgg 13560 
ggttgcgctc 13620 
acgaactctg 13680 
gatcaccggg 13740 
13773 



<210> 3 * 

<211> 3909 - ■ 

<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> RD1-AP34 (a 3909 bp fragment of the M. 
tuberculosis H37Rv genome) 



<400> 3 

gaattcccat 

tcgccagacg 

gcagcacccc 

agcccggtaa 

cggtttgtgt 

acatgatccg 

gacggccggc 

ggtctccgcc 

tgcatcggcc 

ctattcgcaa 

ggagggagtg 

gatggccggc 

ggcggctctg 

ctggactgga 

acaaaccgcg 

atacacccag 

ccaggccgtc 

cgagatggat 

ggccgagacc 

tcccggcgcg 

aacaccggtt 

gagcggcccg 

ggtgggcggc 

cggcaccagt 

gggcctgctg 

gatgtctcag 

atcgtcggcg 

atccggcggc 

agaagacgac 

agacttcccg 

agaaagtagt 

ggtaatttcg 

gcaggttcgt 

gtgcgcttcc 

attcgtcagg 

tcgcaaatgg 

agcagtggaa 

ccattcattc 

gcggtagcgg. 

agctgaacaa 

cttcgaccga 

atagcgaaac 

tatacgtttg 



ccagtgagtt 

gcaaagaggt 

caagcgccgg 

tcgagttcgg 

acgggataca 

atcgctgccg 

tcgacggcgc 

caagcggcga 

caagaccagc 

atcgacgacg 

atcaccatgc 

gcgggtccgg 

gacgctcagg 

ggtggcagcg 

tcaacacagg 

gccatggcca 

cttacggcca 

tatttcatcc 

gcggttaaca 

agccagagca 

ggccagttgc 

atgcagcagc 

accggcggcg 

ccgctgtcga 

cgcgcggagt 

ctgatcgaaa 

acgggtggcg 

tccaccaggc 

gaggacgact 

gccacccggg 

ccagcatggc 

agcggatctc 

tgcagggcca 

aagaagcagc 

ccggcgtcca 

gcttctgacc 

tttcgcgggt 

cctccttgac 

ttcggaggcg 

cgcgctgcag 

aggcaacgtc 

acgggatcgg 

agcgcactct 



caaggtcaag 

catccaggcc 

ttaagattat 

gcaatgctga 

aatacaggga 

acattggcac 

tgacgtcggt 

cggcgttcac 

tccaccgtgc 

gcgccgccgg 

tgtggcacgc 

ctccaatgct 

ccgtcgagtt 

acaaggcgct 

ccaagacccg 

cgacgccgtc 

ccaacttctt 

gtatgtggaa 

cgcttttcga 

cgacgaaccc 

cgccggcggc 

tgacccagcc 

gcaacccagc 

accatccgct 

cgctacctgg 

agccggttgc 

ccgctccggt 

cgggtctggt 

gggacgaaga 

ccggaagact 

agagatgaag 

cggcgacctg 

gtggcgcggc 

caataagcag 

atactcgagg 

cgctaatacg 

atcgaggccg 

gaggggaagc 

taccagggtg 

aacctggcgc 

actgggatgt 

gcgagttcga 

gagaggttgt 



cggcgccccc 

ccctacatcg 

ttcattgccg 

ccatcgggtt 

gggaagaagt 

gcaagtgagc 

gaccgggctg 

atcggagggc 

gggcgaagcg 

cgtcttcgcc 

aatgccaccg 

tgcggcggcc 

gaccgcgcgc 

tgcggctgca 

tgcgatgcag 

gctgccggag 

cggtatcaac 

ccaggcagcc 

gaagctcgag 

gatcttcgga 

tacccagacc 

gctgcagcag 

cgacgaggaa 

ggctggtgga 

cgcaggtggg 

cccctcggtg 

gggtgcggga 

cgcgccggca 

ggacgactgg 

tgccaacatt 

accgatgccg 

aaaacccaga 

gcggcgggga 

aagcaggaac 

gccgacgagg 

aaaagaaacg 

cggcaagcgc 

agtccctgac 

tccagcaaaa 

ggacgatcag 

tcgcataggg 

ccttccgtcg 

catggcggcc 



ctggccaggc 

agcctccaga 

gtgtagcagg 

tgtttccggc 

aggcaaatgg 

gacaacgctc 

gttcccgcgg 

atccaattgc 

gtccaggacg 

gaataggccc 

gagctaaata 

gcgggatggc 

ctgaactctc 

acgccgatgg 

gcgacggcgc 

atcgccgcca 

acgatcccga 

ctggcaatgg 

ccgatggcgt 

atgccctccc 

ctcggccaac 

gtgacgtcgt 

gccgcgcaga 

tcaggcccca 

tcgttgaccc 

atgccggcgg 

gcgatgggcc 

ccgctcgcgc 

tgagctcccg 

ttggcgagga 

ctaccctcgc 

tcgaccaggt 

cggccgccca 

tcgacgagat 

agcagcagca 

gagcaaaaac 

aatccaggga 

caagctcgca 

atgggacgcc 

cgaagccggt 

caacgccgag 

gtctcgccct 

gactacgaca 



atttctcgtc 

agaagtgttc 

acccgagctc 

tataaccgaa ■ 

aaaaaatgtc 

tgcacggcgt 

gggccgatga 

tggcttccaa 

tcgcccgcac 

ccaacacatc 

ccgcacggct 

agacgctttc 

tgggagaagc 

tggtctggct 

aagccgcggc 

accacatcac 

tcgcgttgac 

aggtctacca 

cgatccttga 

ctggcagctc 

tgggtgagat 

tgttcagcca 
tgggcctgct 
gcgcgggcgc 
gcacgccgct 
ctgctgccgg 
agggtgcgca 
aggagcgtga 
taatgacaac 
aggtaaagag 
gcaggaggca 
ggagtcgacg 
ggccgcggtg 
ctcgacgaat 
ggcgctgtcc 
atgacagagc 
aatgtcacgt 
gcggcctggg 
acggctaccg 
caggcaatgg 
ttcgcgtaga 
ttctcgtgtt 
agctcttccg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
• 2220 
2280 
2340 
2400 
2460 
2520 
2580 
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gccgcacgaa 
tgcttcgttt 
gcccccgacg 
acccccacct 
ggaaccggcc 
cccacccaaa 
accacccaca 
ggcgcccccc 
accggcgccc 
gccctgggca 
cccggccgaa 
ccgctatcgc 
ggcgcggctg 
gccagcgccg 
agaacctccc 
ccaccccgat 
cactggcggt 
aaggccggcg 
gaagccgccc 
aatcaacctg 
ccgcaatccc 
aaccacgctg 
ggctctaga 



ggtatggaag 
ccgccggcgc 
tccgacgacc 
ccgcctccgc 
gcatctaaac 
ccacccacac 
cctccgatgc 
agaccaccga 
cacgtaccct 
aagatgccaa 
ccaccgaccc 
acagacaccg 
cgggcagagg 
ttgggccaac 
cccagcccct 
ttagccgccc 
cgtcgccgca 
gccaaggggc 
aaagtggtgt 
ggcctgtcac 
cgcgggtcgt 
acagcagcgt 



ctccggacga 
ccgcatcggc 
tgtcggagcg 
caactccgat 
cacccacacc 
cccccatgcc 
ccatcgccgg 
caccacaaac 
cgcacgggcc 
tcggcgaacc 
ggcctgcccc 
aacgaaacgt 
aagcatccgg 
cgagatcgta 
cgccgcagcg 
aacatgccgc 
agcgtgcagc 
cgaaggtgaa 
cgcagcgcgg 
ccgacgagaa 
atcagatcgc 
tggggtcgac 



15/66 

tatggcagcg 
aaacctaccg 
gttcgtgtcg 
gccgatcgcc 
ccccatgccc 
catcgccgga 
acctgcaccc 
gccaaccgga 
acatcaaccc 
cccgcccgct 
ccaacactcc 
cgggaaggta 
cgcgcagctc 
tctggctccg 
caactccggt 
ggcgcaacct 
gccggatctc 
gaaggtgaag 
ctggcgacat 
gtacgagctg 
cgtcgtcggt 
gttggctcag 



cagccgttct 
aagcccaacg 
gccccgccgc 
gcaggagagc 
atcgccggac 
cccgaaccgg 
accccaaccg 
gcgccgcagc 
cggcgcaccg 
ccgtccagac 
cgacgtgcgc 
gcaactggtc 
gcccccggaa 
cccacccgcc 
cggcgtgccg 
gattcaatta 
gacgcgacac 
ccccagaaac 

tgggtgcatg 

gacctgcacg 
ctcaaaggtg 
gtgcgggccg 



tcgaccccag 
gccagactcc 
cgccaccccc 
cgccctcgcc 
ccgaaccggc 
ccccacccaa 
aatcccagtt 
aaccggaatc 
caccagcacc 
cgtctgcgtc 
gccggggtca 
catccatcca 
cggagccctc 
ccgcgccgac 
agcgacgcgt 
cggccgcaac 
agaaatcctt 
cgaaggccac 
cgttgacgcg 
ctcgagtccg 
gggctggcaa 
accggatcct 



2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3909 



<210> 4 
<211> 324 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> DNA sequence Rv3861 



<400> 4 

gtgacctggc 

acgtcaatct 

cagcgcgaga 

ctgttccgca 

agacacccgg 

caacgcgaat 



tggctgaccc 
cggcgcccat 
aatcttgtcg 
gacccctcga 
tggtcgcgca 
gcccgcgccc 



ggtcggcaac 
cgtcgaatcc 
atgttctcgc 
accagcggtc 
ccgggtaacc 
ggcc 



agcaggatcg cccgagcgca ggcctgcaaa 60 
tggcgggcgc aacgcggcgc gcaatgtgga 120 
gctgtccaca tccagggcat ctcaccgcca 180 
caggcggcgg ttgcgtcatg ccgattgggc 240 
gttgcgctcg gccagggatc gcagctggcc 300 

324 



<210> 5 
<211> 348 
<212> DNA 

<213> Mycobacterium tuberculosis 



<220> 

<223> DNA sequence Rv3862c-whiB6 



<400> 5 

atgcgatacg ctttcgcggc agaggctaca 
atgacagtaa ccgccctgta tgaggtcccg 
tggacgacga ctcccgacga cgaggccaag 
ctgtgtgcac gcgacgccgt cgagtccgcg 
attcccgaat caggccgggc gcgggcattc 
cgcaacggtt acccggtgcg cgaccaccgg 



acctgcaacg ctttctggag gaacgtagac 60 
ctcggcgttt gcacgcaaga tcccgatcgt 12 0 
accctgtgcc gggcttgccc gcgccggtgg 18 0 
ggtgcggaag ggctgtgggc aggggtcgta 24 0 
gcgttgggcc agctgcgatc cctggccgag 300 
gtgtctgccc aatcggca 34 8 



<210> 6 
<211> 1176 



wo 03/085098 
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<212> DNA 

<213> Mycobacterium txiberculosis 



<220> 

<223> DNA sequence RV3863 



<400> 6 

atggcgggcg 

acgcagatgt 

ggcacatcag 

gtcggcaccc 

gatggacggt 

ggcagctcgg 

aaggcgttga 

caaccatcag 

cccggcgtgg 

caacgcagct 

ggccgtagtt 

gctggaacca 

gcgtcccccg 

gccgccgtag 

ttcaccgaac 

caagcaaccc 

cgccaccacg 

cgcttatatg 

gscgtcgcag 

accgaagcga 



agcggaaagt 
ctaaagcagg 
acgtgattga 
agatcgacga 
ggcggatcgt 
tgaccgtcag 
cgttcgaagt 
cggacctgtc 
ttcgcgcagg 
tggcggccga 
ggccccggga 
tcgcgcgaat 
gactccggcc 
acggctgcag 
gtgccgcgcc 
gcatcagccg 
acgaattaat 
ccgcacggcg 
aagaaatgat 
tcgaagcact 



ctgcccaccg 
gtcgactgtc 
gccccgtcgc 
ttctcgaatc 
cggcaacatc 
cgataagacc 
cgtcaggccg 
ggacgacccg 
ggcggccgcg 
cgggatcatc 
acggacccgg 
ccgtcggggc 
tgccgacggc 
tctggctatc 
gatccttgct 
gattaccccg 
gaggctggga 
gcgcgcgaac 
cgtcggcgcc 
gatccgtcag 



tcccggctag 
ggaccggcgc 
ggtgtcgcga 
tctcagacac 
ccgagaggta 
ctaatccgat 
tcggattccg 
gcgcacaacg 
gctgcgcgcc 
aacgcgggcg 
gcaaaactcg 
gagcccaccg 
ccggcgtcct 
gcagcgttgc 
gatttgcgcc 
gaattgatca 
gcaaccgccc 
ctttccaccc 
gaagccgagg 
atcaat 



tacccgcgaa 
cgctggtcgc 
tcattggcca 
atctgcgagc 
tgttcgtcgg 
tcggcgatcc. 
ctgcacagca 
ctgcgccggt 
gtcgtgaact 
cgctcatcgc 
aagaagtgct 
agcccgcaac 
tgatcgcgca 
cggcgaccga 
agctcgaggc 
aggcgttggg 
ctggtgccac 
tggagactgc 
aagagttgcc 



taagggatca 
gtgcagcggc 
ctcgtgccga 
ggtatccgat 
cggacgacgc 
ccctggaggc 
cggccgcgta 
cgcaccggac 
tgacatcagc 
gttcgagaaa 
gcagtggccc 
aaaccccgac 
ggctgtcacc 
ggaccccgag 
gattgccgtc 
cgcggtacgt 
actggcgcag 
ccaagcggcc 
agccgaggcc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

114 0 

1176 



<210> 7 
<211> 1206 
<212> DNA 

<213> Mycobacterium tuberculosis 

<220> 

<223> DNA sequence Rv3 864 



<400> 7 

atggcatcgg 

ttgcttggag 

aaacaaatca 

gcggaagctt 

ctcgacaaat 

gacgtcctgc 

gaaaagattc 

atcgcgatgg 

gcgaccaacc 

ttccccggcc 

aagttgcccg 

tggccgccca 

gggttcccgg 

cccaacttgt 

ggcaagttac 

ggcctgccca 

caggtgaccg 

caactggcca 

ggccagcagc 

gcggagcgtg 

gtcctt 



gtagcggtct 
agggaatccc 
gcgacaaaat. 
acctaaacca 
taaccggcaa 
gggccatgaa 
cgctgctcgg 
ccgttgtcgg 
tgaggggaat 
tgcccgggtt 
acattccgat 
cccccggcag 
agttccccgc 
tccccggctt 
ccacctggac 
gcttgggttt 
ccaccatggg 
gcatgggcag 
acgccaccct 
cacccatcga 



ttgcaagacg 
cgacccaggc 
gggactcgcc 
gaacatcgcg 
catgatctcg 
gaagatgatt 
ccacttgtgg 
cggcgcattg 
tctcggcagg 
gcccagcctg 
ccccggcctg 
cccgttgttc 
catccccggg 
gccgggtctg 
tgagctggcc 
tggcaatctg 
tcagctgcaa 
ccaacaagcg 
cgtgagcgac 
cgctggcacc 



acgagtaact 
gacattttca 
attccgggca 
caacaacttc 
aatcaggcca 
gacggtgtct 
tcgtgggagc 
ctctatctaa 
ctgatcgaga 
cccgacatca 
cccgacatcc 
cccgacctcc 
ttccccgcac 
ggcgacctgc 
gctttgcctg 
ctcagctttg 
cagctcgtgg 
caactgatct 
aagaaggaag 
gcggccagcc 



ttatttgggg 
acaccggttc 
ccaactggat 
gcgcacaggt 
aatacgtctc 
acaaggtttg 
tcgcaatccc 
cgattatgac 
tgttgacgac 
tcgacggcct 
cgggcctacc 
cgtcgttccc 
tgcccgggtt 
tgcccggcgt 
acttcttggg 
ccagtttgcc 
cggccggcgg 
cgtcgcaggc 
acgaggaagg 
aacgggggca 



ccagttactc 
gtcgctgttc 
cggccaagcg 
gatgggcgat 
cgatacgcgc 
taagggcctc 
tatgtccggc 
gctgatgaat 
cttgccaaag 
ctggccgccg 
cgacttcaaa 
agggttcccc 
gcccagcatt 
aggcgatttg 
cggcttcgcc 
caccgtgggt 
tggccccagc 
ccagcaagga 
cgtggccgag 
ggaggggacc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

90O 

960 

1020 

10 80. 

1140 

1200 

1206 
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<210> 8 
<211> 309 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> DNA sequence Rv3 865 
<400> 8 

atgaccggat ttctcggtgt cgtgccttcg ttcctgaagg tgctggcggg catgcacaac 60 
gagatcgtgg gtgatatcaa aagggcgacc gatacggtcg ccgggattag cggacgagtt 120 
cagcttaccc atggttcgtt cacgtcgaaa ttcaatgaca cgctgcaaga gtttgagacc 180 
acccgtagca gcacgggcac gggtttgcag ggagtcacca gcggactggc caataatctg 240 
ctcgcagccg ccggcgccta cctcaaggcc gacgatggcc tagccggtgt tatcgacaag 300 
attttcggt , 309 



<210> 9 
<211> 849 
<212> DNA 

<213> Mycobacterium txiberculosis 

<220> 

<22 3> DNA sequence Rv3 866 
<400> 9 ■ . 

atgacgggtc cgtccgctgc aggccgcgcg ggcaccgccg acaacgtggt cggcgtcgag 60 
gtaaccatcg acggcatgtt ggtgatcgcc gatcggttac acctggttga tttccctgtc 120 
acgcttggga ttcggccgaa tatcccgcaa gaggatctgc gagacatcgt ctgggaacag 180 
gtgcagcgtg acctcacagc gcaaggggtg ctcgacctcc acggggagcc ccaaccgacg 240 
gtcgcggaga tggtcgaaac cctgggcagg ccagatcgga ccttggaggg tcgctggtgg 3 00 
cggcgcgaca ttggcggcgt catggtgcgc ttcgtcgtgt gccgcagggg cgaccgccat 3 60 
gtgatcgcgg cgcgcgacgg cgacatgctg gtgctgcagt tggtggcgcc gcaggtcggc 420 
ttggcgggca tggtgacagc ggtgctgggg cccgccgaac ccgccaacgt cgaacccctg 480 
acgggtgtgg caaccgagct agccgaatgc acaaccgcgt cccaattgac gcaatacggt 540 
atcgcaccgg cctcgg'cccg cgtctatgcc gagatcgtgg gtaacccgac cggctgggtg 600 
gagatcgttg ccagccaacg ccaccccggc ggcaccacga cgcagaccga cgccgccgct 660 
ggcgtcctgg actccaagct cggtaggctg gtgtcgcttc cccgccgtgt tggaggcgac 720 
ctgtacggaa gcttcctgcc cggcactcag cagaacttgg agcgtgcgct ggacggcttg 780 
ctagagctgc tccctgcggg cgcttggcta gatcacacct cagatcacgc acaagcctcc 840 
tcccgaggc 



<210> 10 • 
<211> 552 
<212> DNA 

<213> mycobacterium tuberculosis 

<220> ^ 
<223> DNA sequence Rv3867 

<400> 10 

atggtggacc cgccgggcaa cgacgacgac cacggtgatc tcgacgccct cgatttctcc 
gccgcccaca ccaacgaggc gtcgccgctg gacgccttag acgactatgc gccggtgcag. 
accgatgacg ccgaaggcga cctggacgcc ctccatgcgc tcaccgaacg cgacgaggag 
ccggagctgg agttgttcac ggtgaccaac cctcaagggt cggtgtcggt ctcaaccctg 
atggacggca gaatccagca cgtcgagctg acggacaagg cgaccagcat gtccgaagcg 
cagctggccg acgagatctt cgttattgcc gatctggccc gccaaaaggc gcgggcgtcg 
cagtacacgt tcatggtgga gaacatcggt gaactgaccg acgaagacgc agaaggcagc 
gccctgctgc gggaattcgt ggggatgacc ctgaatctgc cgacgccgga agaggctgcc 
gcagccgaag ccgaagtgtt cgccacccgc tacgatgtcg actacacctc ccggtacaag 



60 
120 
180 
240 
300 
360 
420 
480 
540 



wo 03/085098 PCT/IBOS/0 1 789 

18/66 



gccgatgact ga 



552 



<210> 11 
<211> 1722 
<212> DNA 

<213> mycobacterium tuberculosis 
<220> 

<223> DNA sequence Rv3868 
<400> 11 

atgactgatc gcttggccag tctgttcgaa agcgccgtca gcatgttgcc gatgtcggag 
gcgcggtcgc tagatctgtt caccgagatc accaactacg acgaatccgc ttgcgacgca 
tggatcggcc ggatccggtg tggggacacc gaccgggtga cgctgtttcg cgcctggtat 
tcgcgccgca atttcggaca gttgtcggga tcggtccaga tctcgatgag cacgttaaac . 24 0 
gccaggattg ccatcggggg gctgtacggc gatatcacct acccggtcac ctcgccgcta 300 
gcgatcacca tgggctttgc cgcatgcgag gcagcgcaag gcaattacgc cgacgccatg 360 
gaggccttag aggccgcccc ggtcgcgggt tccgagcacc tggtggcgtg gatgaaggcg 420 
gttgtctacg.gcgcggccga acgctggacc gacgtgatcg accaggtcaa gagtgctggg 480 
aaatggccgg acaagttttf ggccggcgcg gccggtgtgg cgcacggggt tgccgcggca 
aacctggcct tgttcaccga agccgaacgc cgactcaccg aggccaacga ctcgcccgcc 
ggtgaggcgt gtgcgcgcgc catcgcctgg tatctggcga tggcacggcg cagccagggc 
aacgaaagcg ccgcggtggc gctgctggaa tggttacaga ccactcaccc cgagcccaaa 
gtggctgcgg cgctgaagga tccctcctac cggctgaaga cgaccaccgc cgaacagatc 
gcatcccgcg ccgatccctg ggatccgggc agtgtcgtga ccgacaactc cggccgggag 
cggctgctcg ccgaggccca agccgaactc gaccgccaaa ttgggctcac ccgggttaaa 
aatcagattg aacgctaccg cgcggcgacg ctgatggccc gggtccgcgc cgccaagggt 
atgaaggtcg cccagcccag caagcacatg atcttcaccg gaccgcccgg taccggcaag 
accacgatcg cgcgggtggt ggccaatatc ctggccggct taggcgtcat tgccgaaccc 
aaactcgtcg agacgtcgcg caaggacttc gtcgccgagt acgaggggca atcggcggtc 
aagaccgcta agacgatcga tcaggcgctg ggcggggtgc ttttcatcga cgaggcttat 
gcgctggtgc aggaaagaga cggccgcacc gatccgttcg gtcaagaggc gctggacacg 
ctQctggcgc ggatggagaa cgaccgggac cggctggtgg tgatcatcgc cgggtacagc 
tccgacatag atcggctgct ggaaaccaac gagggtctgc ggtcgcggtt cgccactcgc 
atcgagttcg acacctattc cbccgaggaa ctcctcgaga tcgccaacgt cattgccgct 
Qctgatgatt cggcgttgac cgcagaggcg gccgagaact ttcttcaggc cgccaagcag 
ttggagcagc gcatgttgcg cggccggcgc gccctggacg tcgccggcaa cggtcggtat 
gcgcgccagc tggtggaggc cagcgagcaa tgccgggaca tgcgtctagc ccaggtcctc 
gatatcgaca ccctcgacga agaccggctt cgcgagatca acggctcaga tatggcggag 
gctatcgccg cggtgcacgc acacctcaac atgagagaat ga 

<210> 12 
<211> 1443 
<212> DNA 

<213> mycobacterium tuberculosis 
<220> 

<223> DNA sequence Rv3869 



<400> 12 

atggggcttc gcctcaccac caaggttcag gttagcggct ggcgttttct gctgcgccgg 
ctcgaacacg ccatcgtgcg ccgggacacc cggatgtttg acgacccgct gcagttctac 
agccgctcga tcgctcttgg catcgtcgtc gcggtcctga ttctggcggg tgccgcgctg 
ctggcgtact tcaaaccaca aggcaaactc ggcggcacca gcctgttcac cgaccgcgcg 
accaaccagc tttacgtgct gctgtccgga cagttgcatc cggtctacaa cctgacttcg 
gcgcggctgg tgctgggcaa tccggccaac ccggccaccg tgaagtcctc cgaactgagc 
aagctgccga tgggccagac cgttggaatc cccggcgccc cctacgccac gcctgtttcg 
gcgggcagca cctcgatctg gaccctatgc gacaccgtcg cccgagccga ctccacttcc 
ccggtagtgc agaccgcggt catcgcgatg ccgttggaga tcgatgcttc gatcgatccg 



60 
120 
180 



540 
600. 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
150 0 
1560 
1620 
1680 
1722 



60 
120 
180 
240 
300 
360 
420 
480 
540 



wo 03/085098 



19/66 



PCT/IB03/01789 



ctccagtcac 
aagggacgcc 
gtgacggcca 
ccctggcagc 
gatctagtga 
gtgctgcccg 
caggcgcacg 
gaacgggtat 
gatcccgcgc 
gtgctgtccg 
atccacggga 
gatcctcgat 
cctaacgcgg 
atcgttcgtc 
gacacgctgc 
tga 



acgaagcggt 
acgccataga 
ggccaacccc 
tgccgccgat 
tcggatcggt 
acggcatcgc 
ggctggtcgc 
acccctcacc 
tgtgctggtc 
gccggcatct 
cggcgaccgt 
acaccgaatc 
agacagccaa 
tcctggtcga 
ccgctgaccc 



gctggtgtcc 
tctgaccgac 
gatctcggag 
accggcggcg 
cttccagatc 
gcaggtcaac 
gccaccggca 
gctacccgat 
atggcaacgc 
gccgatatcg 
ttacctcgac 
gatgtactac 
gtcgctgggc 
cggtccggtg 
tagcccccga 



taccagggcg 
cgcgccctca 
ggcatgttca 
ggcgcgccca 
cacaccgaca 
gcgacaaccg 
atggtgccca 
gaaccgctca 
agcgccggcg 
ccctcagcga 
ggcggaaaat 
atcgatccac 
ctgagttcac 
ctgtcgaaag 
aaagttcccg 



aaacctggat 
cctcgtcgat 
acgcgctgcc 
attcgcttgg 
agggcccgca 
ctgcggcgct 
gtctggtcgt 
agatcgtgtc 
accagtcgcc 
tgaacatggg 
tcgtggcact 
agggcgtgcg 
cccaaaacgc 
atgccgcact 
ccggagcctc 



cgtcacaact 
ggggataccg 
tgatatgggg 
cctacctgat 
atactatgtg 
gcgcgccacc 
cagaatcgcc 
ccggccgcag 
gcagtcaacg 
gatcaagcag 
gcaatccccc 
ttatggggtg 
gccctgggag 
gctcgagcac 
cggagccccc 



600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1443 



<210> 13 
<211> 2244 
<212> DNA 

<213> mycobacterium tuberculosis 
<220> 

<223> DNA sequence Rv3870 



<40q> 13 

atgacgacca 

atcagcctca 

atccttccct 

ggcaccaggc 

gtcggcggtc 

gaccgcaagg 

acctctcagg 

gtcggcaccc 

atcggtatcg 

ttggccgccg 

gtcaagtttc 

acctttccga 

atgatctgtc 

gaggaacccg 

accgaaaccg 

gacctggccg 

gtcgtcgacc 

gtgatcacgt 

acggctgatg 

atgtcgccgc 

accatcctcg 

gtcggtgcgc 

gaccgtgacc 

ctggacatca 

acggggtctg 

catccagatc 

atggaaaagc 

gtcagccgga 

caggccggga 

taccgcgaac 

ttcgccgagc 

gtcgggcggt 

gttcgcatcg 

tctcatgaat 



agaagttcac 

cgccgcccga 

acgtgatggg 

agctgtcgcc 

tggccggtag 

agtacctgcg 

tggcgttctt 

aacggcagtg 

gtgaccagcc 

ccagcgcagc 

tacgaaccca 

ctatcgcgat 

acctagccgt 

acgaccccga 

atgcggccgg 

cgcgcgggcc 

tgaccggcgg 

tgggcaacca 

accggctccc 

agcaagccag 

acaagacgtc 

aaagtgtcga 

ggctaaagat 

aagagggcgc 

ggaagtccga 

aggtgaatct 

ttccgcacac 

tgggcgaggt 

tgaaagtcgg 

gcggtgccga 

tgttgcagag 

cgc.tgagggt 

acaaactgga 

ccaaggcggt 



tcccaccatt 

tgacctgggc 

tggcgccatg 

gtacatgttg 

caccggtggt 

gtatttggca 

ctcctaccac 

gtcccggccg 

ggcggtggat 

acctcagccg 

tggattgatc 

cggcggggac 

gttccaccca 

ctggtcctgg 

gtccacccgg 

acacgcaccc 

caaggctgga 

tcgcggctcg 

taaccaatcg 

ccgtatcgcg 

gcgggtccag 

ggagataaca 

cccgtttggt 

ggaattcggc 

attcctgcgc 

cctgctcacc 

tgccgctgtc 

gttgaccgga 

cgcggccgga 

cctacccccg 

tcacccggac 

ccatctgctg 

gccaaacctg 

aatcggcaca 



acccgtggcc 

atcgacatcc 

ctcggcatga 

atgatgccgc 

ggcggcaaga 

ggactacgca 

gcaccgcatc 

gccaacgccg 

cgattattga 

ttcctggagc 

catgactgcc 

ttggcggggg 

ccggacctgc 

ctcaaatggc 

ctgatcttca 

gattcgcttc 

ttcccgcccg 

gcctaccgca 

tttcgccagg 

cgaaagttgg 

aagaaggtgg 

ccttcccgct 

catgaactaa 

gccggaccgc 

accctgatcc 

gacttcaaag 

gtcaccaaca 

gaactcgatc 

gccctgtccg 

ctgccaacgc 

ttcatcgggc 

ctggctaccc 

acatatcgaa 

ccggaggcgc 



cccggttgac 

caccgtcggg 

tcgccatcat 

tgatgatgat 

aggtgcccga 

cccgagtgac 

ccgaggatct 

acttctatgc 

agccggccgt 

cggtcagtca 

cgaaactgct 

cagccggcct 

tgcagatccg 

ttccgcacgt 

cgcgccagga 

ccggcggccc 

acggtagggc 

tcagggtgca 

tgacatcggt 

ccggatggtc 

ccaccgactg 

ggaggatgta 

agaccggcaa 

acggcatgct 

tgtcgctggt 

gtggttcaac 

tggccgagga 

ggcgccagtc 

gcgtggccga 

ttttcgtcgt 

tgttcgaccg 

agtcgctgca 

tcgcattgcg 

agtacatcac 



cccgggcgag 

cgtccaaaag 

ggtggccggc 

cgtgatgatg 

aatcaacgcc 

gtcctcggcc 

gttgtcgatc 

ggccacccga 

cggcggggag 

tatgtgggtg 

gcaactccgt 

gatgacggcg 

ggtgctcacc 

acagcaccag 

aggtctgtcg 

ctacgtagtc 

cggtgtcacg 

cgaggatggg 

caccgatcgg 

catcacgggc 

gcaccagctg 

caccgacacc 

cgtcatgtac 

catcgggacc 

ggcaatgact 

cttcctggga 

agccgagctc- 

gatcctccga 

atacgagaag 

cgtcgacgag 

gatctgccgc 

gaccggcggt 

caccaccagc 

caacaaggag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
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agcggtgtcg ggtttctccg ggtcggcatg gaagacccgg tcaagttcag caccttctac 2100 

atcagtgggc catacatgcc gccggcggca ggcgtcgaaa ccaatggtga agccggaggg 2160 

cccggtcaac agaccactag acaagccgcg cgcattcaca ggttcaccgc ggcaccggtt 2220 

ctcgaggagg cgccgacacc gtga 2244 

<210> 14 
<211> 1776 
<212> DNA 

<213> mycobacterium tuberculosis 
<220> 

<223> DNA sequence Rv3871 
<400> 14 

atgactgctg aaccggaagt acggacgctg cgcgaggttg tgctggacca gctcggcact 60 

gctgaatcgc gtgcgtacaa gatgtggctg ccgccgttga ccaatccggt cccgctcaac 120 

gagctcatcg cccgtgatcg gcgacaaccc ctgcgatttg ccctggggat catggatgaa 180 

ccgcgccgcc atctacagga tgtgtggggc gtagacgttt ccggggccgg cggcaacatc 240 

ggtattgggg gcgcacctca aaccgggaag tcgacgctac tgcagacgat ggtgatgtcg 3 00 

gccgccgcca cacactcacc gcgcaacgtt cagttctatt gcatcgacct aggtggcggc 360 

gggctgatct atctcgaaaa ccttccacac gtcggtgggg tagccaatcg gtccgagccc . 420 

gacaaggtca accgggtggt cgcagagatg caagccgtca tgcggcaacg ggaaaccacc 480 

ttcaaggaac accgagtggg ctcgatcggg atgtaccggc agctgcgtga cgatccaagt 540 

caacccgttg cgtccgatcc atacggcgac gtctttctga tcatcgacgg atggcccggt . 600 

tttgtcggcg agttccccga ccttgagggg caggttcaag atctggccgc ccaggggctg • 660 

gcgttcggcg tccacgtcat catctccacg ccacgctgga cagagctgaa gtcgcgtgtt 720 

cgcgactacc tcggcaccaa gatcgagttc cggcttggtg acgtcaatga aacccagatc 780 

gaccggatta cccgcgagat cccggcgaat cgtccgggtc gggcagtgtc gatggaaaag 840 

caccatctga tgatcggcgt gcccaggttc gacggcgtgc acagcgccga taacctggtg 900 

gaggcgatca ccgcgggggt gacgcagatc gcttcccagc acaccgaaca ggcacctccg 960 

gtgcgggtcc tgccggagcg tatccacctg cacgaactcg acccgaaccc gccgggacca 102 0 

gagtccgact accgcactcg ctgggagatt ccgatcggct tgcgcgagac ggacctgacg 1080 

ccggctcact gccacatgca cacgaacccg cacctactga tcttcggtgc ggccaaatcg 1140 

ggcaagacga ccattgccca cgcgatcgcg cgcgccattt gtgcccgaaa cagtccccag 1200 

caggtgcggt tcatgctcgc ggactaccgc tcgggcctgc tggacgcggt gccggacacc 1260 

catctgctgg gcgccggcgc gatcaaccgc aacagcgcgt cgctagacga ggccgttcaa .132 0 

gcactggcgg tcaacctgaa gaagcggttg ccgccgaccg acctgacgac ggcgcagcta 1380 

■cgctcgcgtt cgtggtggag cggatttgac gtcgtgcttc tggtcgacga ttggcacatg 1440 

atcgtgggtg ccgccggggg gatgccgccg atggcaccgc tggccccgtt attgccggcg 150 0 

gcggcagata tcgggttgca catcattgtc acctgtcaga tgagccaggc ttacaaggca 15 60 

accatggaca agttcgtcgg cgccgcattc gggtcgggcg ctccgacaat gttcctttcg 1620 

ggcgagaagc aggaattccc atccagtgag ttcaaggtca agcggcgccc ccctggccag 1680 

gcatttctcg tctcgccaga cggcaaagag gtcatccagg ccccctacat cgagcctcca 1740 

gaagaagtgt tcgcagcacc cccaagcgcc ggttaa 1776 

<210> 15 
<211> 297 
<212> DNA 

<213> Mycobacterium txiberculosis 
<220> 

<223> PE. coding sequence {Rv3872) 



<400> 15 

atggaaaaaa tgtcacatga tccgatcgct gccgacattg gcacgcaagt gagcgacaac 
gctctgcacg gcgtgacggc cggctcgacg gcgctgacgt cggtgaccgg gctggttccc 
gcgggggccg atgaggtctc cgcccaagcg gcgacggcgt tcacatcgga gggcatccaa 
ttgctggctt ccaatgcatc ggcccaagac cagctccacc gtgcgggcga agcggtccag 
gacgtcgccc gcacctattc gcaaatcgac gacggcgccg ccggcgtctt cgccgaa 



60 
120 
180 
240 
297 
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<210> 16 
<211> 1104 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> PPE coding sequence {Rv3873) 
<400> 16 

atgctgtggc acgcaatgcc accggagcta aataccgcac ggctgatggc cggcgcgggt 60 

ccggctccaa tgcttgcggc ggccgcggga tggcagacgc tttcggcggc tctggacgct 120 

caggccgtcg agttgaccgc gcgcctgaac tctctgggag aagcctggac tggaggtggc 180 

agcgacaagg cgcttgcggc tgcaacgccg atggtggtct ggctacaaac cgcgtcaaca 240 

caggccaaga cccgtgcgat gcaggcgacg gcgcaagccg cggcatacac ccaggccatg 3 00 

gccacgacgc cgtcgctgcc ggagatcgcc gccaaccaca tcacccaggc cgtccttacg 3 60 

gccaccaact tcttcggtat caacacgatc ccgatcgpgt tgaccgagat ggattatttc 420 

atccgtatgt ggaaccaggc agccctggca atggaggtct accaggccga gaccgcggtt 480 

aacacgcttt tegagaagct cgagccgatg gcgtcgatcc ttgatcccgg cgcgagccag 54 0 

agcacgacga acccgatctt cggaatgccc tcccctggca gctcaacacc ggttggccag 600 

ttgccgccgg cggctaccca gaccctcggc caactgggtg agatgagcgg cccgatgcag 660 

cagctgaccc agccgctgca gcaggtgacg tcgttgttca gccaggtggg cggcaccggc 72 0 

ggcggcaacc cagccgacga ggaagccgcg cagatgggcc tgctcggcac. cagtccgctg 780 

tcgaaccatc cgctggctgg tggatcaggc cccagcgcgg gcgcgggcct gctgcgcgcg 84 0 

gagtcgctac ctggcgcagg tgggtcgttg acccgcacgc cgctgatgtc tcagctgatc 900 

gaaaagccgg ttgccccctc ggtgatgccg gcggctgctg ccggatcgtc ggcgacgggt 960 

ggcgccgctc cggtgggtgc gggagcgatg ggccagggtg cgcaatccgg cggctccacc 1020 

aggccgggtc tggtcgcgcc ggcaccgctc gcgcaggagc gtgaagaaga cgacgaggac 1080 

gactgggacg aajgaggacga ctgg 1104 

<210> 17 . 
<211> 300 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> CFP-10 coding sequence (Rv3874) 
<400> 17 

atggcagaga tgaagaccga tgccgctacc ctcgcgcagg aggcaggtaa tttcgagcgg 5 0 

atctccggcg acctgaaaac ccagatcgac caggtggagt cgacggcagg ttcgttgcag 120 
ggccagtggc gcggcgcggc ggggacggcc gcccaggccg cggtggtgcg cttccaagaa 180 
gcagccaata agcagaagca ggaactcgac gagatctcga cgaatattcg tcaggccggc ' 240 
gtccaatact cgagggccga cgaggagcag cagcaggcgc tgtcctcgca aatgggcttc 3 00 

<210> 18 

<211> 285 . ■ 

<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> ESAT-6 coding sequence (Ry3875) 



<400> 18 

atgacagagc agcagtggaa tttcgcgggt atcgaggccg cggcaagcgc aatccaggga 
aatgtcacgt ccattcattc cctccttgac gaggggaagc agtccctgac caagctcgca 
gcggcctggg gcggtagcgg ttcggaggcg taccagggtg tccagcaaaa atgggacgcc 
acggctaccg agctgaacaa cgcgctgcag aacctggcgc ggacgatcag cgaagccggt 



60 
120 
180 
240 
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caggcaatgg cttcgaccga aggcaacgtc actgggatgt tcgca 2 85 

<210> 19 
<211> 2O01 
<212> DNA 

<213> mycobacterium tuberculosis 
<220> 

<223> DNA sequence Rv3876 

<400> 19 ^ 
atggcggccg actacgacaa gctcttccgg ccgcacgaag gtatggaagc tccggacgat 60 
atggcagcgc agccgttctt cgaccccagt gcttcgtttc cgccggcgcc cgcatcggca 12 0 
aacctaccga agcccaacgg ccagactccg cccccgacgt ccgacgacct gtcggagcgg 
ttcgtgtcgg ccccgccgcc gccaccccca cccccacctc cgcctccgcc aactccgatg 
ccgatcgccg caggagagcc gccctcgccg gaaccggccg catctaaacc acccacaccc 
cccatgccca tcgccggacc cgaaccggcc ccacccaaac cacccacacc ccccatgccc 
atcgccggac ccgaaccggc cccacccaaa ccacccacac ctccgatgcc catcgccgga 
cctgcaccca ccccaaccga atcccagttg gcgcccccca gaccaccgac accacaaacg 
ccaaccggag cgccgcagca accggaatca ccggcgcccc acgtaccctc gcacgggcca 
catcaacccc ggcgcaccgc accagcaccg ccctgggcaa agatgccaat cggcgaaccc 
ccgcccgctc cgtccagacc gtctgcgtcc ccggccgaac caccgacccg gcctgccccc 
caacactccc gacgtgcgcg ccggggtcac cgctatcgca cagacaccga acgaaacgtc 
gggaaggtag caactggtcc atccatccag gcgcggctgc gggcagagga agcatccggc 
gcgcagctcg cccccggaac ggagccctcg ccagcgccgt tgggccaacc gagatcgtat 
ctggctccgc ccacccgccc cgcgccgaca gaacctcccc ccagcccctc gccgcagcgc 
aactccggtc ggcgtgccga gcgacgcgtc caccccgatt tagccgccca acatgccgcg 
gcgcaacctg attcaattac ggccgcaacc actggcggtc gtcgccgcaa gcgtgcagcg 
ccggatctcg acgcgacaca gaaatcctta aggccggcgg ccaaggggcc gaaggtgaag 
aaggtgaagc cccagaaacc gaaggccacg aagccgccca aagtggtgtc gcagcgcggc 
tggcgacatt gggtgcatgc gttgacgcga atcaacctgg gcctgtcacc cgacgagaag 
tacgagctgg acctgcacgc tcgagtccgc cgcaatcccc gcgggtcgta tcagatcgcc 1260 
gtcgtcggtc tcaaaggtgg ggctggcaaa accacgctga cagcagcgtt ggggtcgacg 1320 
ttggctcagg tgcgggccga ccggatcctg gctctagacg cggatccagg pgccggaaac t^RQ 
ctcgccgatc gggtagggcg acaatcgggc gcgaccatcg ctgatgtgct tgcagaaaaa 
gagctgtcgc actacaacga catccgcgca cacactagcg tcaatgcggt caatctggaa 
gtgctgccgg caccggaata cagctcggcg cagcgcgcgc tcagcgacgc cgactggcat 
ttcatcgccg atcctgcgtc gaggttttac aacctcgtct tggctgattg tggggccggc 
ttcttcgacc cgctgacccg cggcgtgctg tccacggtgt ccggtgtcgt ggtcgtggca 168 0 
agtgtctcaa tcgacggcgc acaacaggcg tcggtcgcgt tggactggtt gcgcaacaac 1740 
ggttaccaag atttggcgag ccgcgcatgc gtggtcatca atcacatcat gccgggagaa 1800 
cccaatgtcg cagttaaaga cctggtgcgg catttcgaac agcaagttca acccggccgg I860 
gtcgtggtca tgccgtggga caggcacatt gcggccggaa ccgagatttc actcgacttg 1920 
ctcgacccta tctacaagcg caaggtcctc gaattggccg cagcgctatc cgacgatttc i^ao 
gagagggctg gacgtcgttg a • . 

<210> 20 
<211> 1536 
<212> DNA 

<213> mycobacterium tuberculosis 
<220> 

<223> DNA sequence Rv3877 

<400> 20 g~f^ 
ttgagcgcac ctgctgttgc tgctggtcct accgccgcgg gggcaaccgc tgcgcggcct 60 
gccaccaccc gggtgacgat cctgaccggc agacggatga ccgatttggt actgccagcg 120 
gcggtgccga tggaaactta tattgacgac accgtcgcgg tgctttccga ggtgttggaa 180 
gacacgccgg ctgatgtact cggcggcttc gactttaccg cgcaaggcgt gtgggcgttc 240 



180 
240 
300 
360 
420 
480 . 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 



1380 
1440 
1500 . 
1560 
1620 
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gctcgtcccg 
gacgggtcac 
gaggatgtca 
' ttgaatcgct 
atgcgggcgt 
gggatcgctg 
gccgagtgcc 
ccgttgccgc 
gtgctgtttt 
tttgccgtga 
caggactggg 
aagctgaccg 
gtggacaacg 
ccgacctggc 
agcaaactgg 
gccggtgcca 
ggtttgatca 
tgggcgttgc 
tggtacccgc 
ctcgtggtgg 
ctggaattga 

ggggtgtacg 



gatcgccgcc 
tgctgactct 
tcgacgcgat 
ttgtgggggc 
ggtgggaaac 
tgctggtagg 
tactggtcac 
gcggggtcaa 
tgaccttgat 
tcaccgctat 
tccccgcggg 
tcgcggtcgc 
aggagttgct 
aggccatcat 
ccaagcaact 
tcgcggtcgt 
cgaccgtctg 
tggcggcgac 
actatgcctg 
tcgggtcgat 
tcgacggcgc 
acacggtccg 



gctgaagctc 
ggtgtcagtc 
cgccgtgctt 
ggcgatcccg 
tgggcgtagc 
cagcttcgtc 
gacgtatctg 
ctcgttgggg 
gacgcggggc 
cgcggtcatc 
ggggatcgca 
gcggatcgcg 
cgatcccgtc 
cgcgtcggtg 
tctgatcgga 
ggtgcgcggg 
cggatttcgc 
ggtcgcgatt 
gctgttgttg 
ggctcacgtc 
catgatcgct 
caatatccgg 



gaccagtcac 
agtcgcaccg 
gacgagtcac 
cttttgaccg 
ttgtggtggc 
gcgaacaggt 
ctgatcgcaa 
gcgccacaag 
ggccctcgga 
gcggccgccg 
ttcgggctgt 
ctgccgccga 
gcgaccccgg 
cccgcgtccg 
tacgtcacgt 
cacttctttg 
tcgcggcttt 
ccgacgggtc 
agcgtctacc 
cggcgcgttt 
gccatcattc 
ttctga 



tcgatgacgc 
agcgctaccg 
ctgagttcga 
cgcccgtcat 
cgttggcgat 
tctaccagag 
ccgccgcagc 
ttgccggcgc 
agcgtcatga 
ctgccttcgg 
tcattgtgac 
ttccggtacc 
aggctaccag 
cggtccggct 
cgggcaccct 
tacacagcct 
acgccgagcg 
tgacggccaa 
tcacggtagc 
caccggtcgt 
ccatgctgct 



cggggtggtc 
accgttggtc 
ccgcacggca 
cgggatggcg 
tggcatcctg 
cggccacctg 
gctggccgtg 
cgctacggcc 
gttggcgtcg 
ctatggatac 
gaatgcggcc 
cggcgaaacc 
cgaagaaacc 
caccgagcgc 
gattctggct 
ggtggtcgcg 
ctggtgtgcg 
actcatcatc 
cctggttgcg 
aaaacgaact 
gtggatcacc. 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1536 



<210> 21 
<211> 840 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> DNA sequence Rv3 87 8 



<400> 21 

atggctgaac 

ggcctcgttt 

gcagcaat'ca 

gtgaaagccg 

aagaccgatc 

ggcctggctg 

acacccgtgt 

gttgttgcga 

aacgcatccc 

cagggcggca 

caagcggagc 

ccggccgagg 

cccggcgggg 

agtccgctgg 



cgttggccgt 
ttccgcagcc 
acgagaccat 
ccctgactcg 
agtcactggg 
gcgtcgcctc 
cacaggtcac 
cggtgccgca 
ccatcgctca 
gcggcccaat 
cggtccacga 
tcgttgccgc 
gcgttcccgc 
cggcccccgt 



cgatcccacc 
tccggcgccg 
gccaagcatc 
aacagcatcc 
aaccagtttg 
ggtcggtggt 
gacccagctc 
actcgttcag 
gacgatcagt 
gcccgcacag 
agtgacaaac 
ggcacgtgac 
gcaagccatg 
cgatccgtcg 



ggcttgagcg 
atcgcggtca 
gaatcgctgg 
aacatgaacg. 
agccagtatg 
cagccaagtc 
ggcgagacgg 
ctggctccgc 
caaaccgccc 
cttgccagcg 
gacgatcagg 
gaaggcgccg 
gataccggag 
actccggcac 



cagcggccgc 
gcggaacgga 
tcagtgacgg 
cggcggcgga 
cattcggctc 
aggctaccca 
ccgctgagct 
acgccgttca 
aacaggccgc 
ctgaaaaacc 
gcgaccaggg 
gcgcatcacc 
ccggtgcccg 
cctcaacaac 



gaaattggcc 60 
ttcggtggta 120 
gctgcccggc 180 
cgtctatgcg 240 
gtcgggcgaa 300 
gctgctgagc 360 
ggcaccccgt 420 
gatgtcgcaa 480 
ccagagcgcg 540 
ggccaccgag 600 
cgacgtgcag 660 
gggccagcag 720 
cccagcggcg 780 
cacaacgttg 840 



<210> 22 
<211> 2187 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> DNA secpience Rv3879c 



<400> 22 • 

atgagtatta ccaggccgac gggcagctat gccagacaga tgctggatcc gggcggctgg 60 
gtggaagccg atgaagacac tttctatgac cgggcccagg aatatagcca ggttttgcaa 120 
agggtcaccg atgtattgga cacctgccgc cagcagaaag gccacgtctt cgaaggcggc 180 



wo 03/085098 



24/66 



PCT/iB03/017S9 



ctatggtccg gcggcgccgc caatgctgcc aacggcgccc tgggtgcaaa catcaatcaa 240 
ttgatgacgc tgcaggatta tctcgccacg gtgattacct ggcacaggca tattgccggg 300 
ttgattgagc aagctaaatc cgatatcggc aataatgtgg atggcgctca acgggagatc 360 
gatatcctgg agaatgaccc tagcctggat gctgatgagc gccataccgc catcaattca 42 0 
ttggtcacgg cgacgcatgg ggccaatgtc agtctggtcg ccgagaccgc tgagcgggtg 48 0 
ctggaatcca agaattggaa acctccgaag aaogcactcg aggatttgct tcagcagaag 54 0 
tcgccgccac ccccagacgt gcctaccctg gtcgtgccaf ccccgggcac accgggcaca SOO - 
ccgggaaccc cgatcacccc gggaaccccg atcaccccgg gaaccccaat cacacccatc 660 
ccgggagcgc cggtaactcc gatcacacca acgcccggca ctcccgtcac gccggtgacc 720 
ccgggcaagc cggtcacccc ggtgaccccg gtcaaaccgg gcacaccagg cgagccaacc 780 
ccgatcacgc cggtcacccc cccggtcgcc ccggccacac cggcaacccc ggccacgccc 840 
gttaccccag ctcccgctcc acacccgcag ccggctccgg caccggcgcc atcgcctggg 900 
ccccagccgg ttacaccggc cactcccggt ccgtdtggtc cagcaacacc gggcacccca 960 
gggggcgagc cggcgccgca cgtcaaaccc gcggcgttgg cggagcaacc tggtgtgccg 1020 
ggccagcatg cgggcggggg gacgcagtcg gggcctgccc atgcggacga atccgccgcg 1080 
tcggtgacgc cggctgcggc gtccggtgtc ccgggcgcac gggcggcggc cgccgcgccg 1140 
agcggtaccg ccgtgggagc gggcgcgcgt tcgagcgtgg gtacggccgc ggcctcgggc 1200 
gcggggtcgc atgctgccac tgggcgggcg ccggtggcta cctcggacaa ggcggcggca 1260 
ccgagcacgc gggcggcctc ggcgcggacg gcacctcctg cccgcccgcc gtcgaccgat 1320 
cacatcgaca aacccgatcg cagcgagtct gcagatgacg gtacgccggt gtcgatgatc 1380 
ccggtgtcgg cggctcgggc ggcacgcgac gccgccactg cagctgccag cgcccgccag 144 0 
cgtggccgcg gtgatgcgct gcggttggcg cgacgcatcg cggcggcgct caacgcgtcc 1500 
gacaacaacg cgggcgacta cgggttcttc tggatcaccg cggtgaccac cgacggttcc 1560 
atcgtcgtgg ccaacagcta tgggctggcc tacatacccg acgggatgga attgccgaat 1620 
aaggtgtact tggccagcgc ggatcacgca atcccggttg acgaaattgc acgctgtgcc 1680 
acctacccgg ttttggccgt gcaagcctgg gcggctttcc acgacatgac gctgcgggcg 1740 
gtgatcggta ccgcggagca gttggccagt tcggatcccg gtgtggccaa gattgtgctg 1800 
gagccagatg acattccgga gagcggcaaa atgacgggcc ggtcgcggct ggaggtcgtc 1860 
•galccctcgg cggcggctca gctggccgac actaccgatc agcgtttgct ^9^=ttgttg 1920 
ccgccggcgc cggtggafcgt caatccaccg ggcgatgagc ggcacatgct gtggttcgag 1980 
ctiatSalc ccItSccSg caccgctacc ggccgcgagg ccgctcatct gcgggcgttc 2040 
cgggcctacg ctgcccactc acaggagatt gccctgcacc aagcgcacac tgcgactgac 2100 
gcggccgtcc agcgtgtggc cgtcgcggac tggctgtact ggcaatacgt caccgggttg 2160 
ctcgaccggg ccctggccgc cgcatgc 

<210> 23 

•<211> 345 

<212> DNA ■ ■ 

<213> Mycobacterium tuberculosis 

<220> . • ■ . 

<223> DNA sequence Rv3880c 

gjgagcatgg acgaattgga cccgcatgtc gcccgggcgt tgacgctggc ggcgcggttt 60 
?a|tcggccl ta|acgggac gctcaatcag atgaacaacg gatccttccg cgccaccgac 120 
galgccgaga ccgtcgaagt gacgatcaat gggcaccagt ggctcaccgg cctgcgcatc 180 
Lagatggtt tgctgaagaa gctgggtgcc gaggcggtgg ctcagcgggt c^^cgaggcg 240 
ctgcacStg c^caggccgc ggcgtccgcg tataacgacg cggcgggcga gcagctgacc 300 
gctgcgttat cggccatgtc ccgcgcgatg aacgaaggaa tggcc 



<210> 24 
<211> 1380 
<212> DNA 

<213> Mycobacterivim. tuberculosis 

<220> 

<223> DNA sequence Rv38Blc 
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<400> 24 

atgacgcagt cgcagaccgt gacggtggat 
gtggaggccc cgatggcgga cccaccgact 
gcggctaaaa acgccgccca acagctggta 
gcggccggtg ccaaagagcg gcagcgtctg 
tatggcgagg ttgatgagga ggctgcgacc 
caggcagaat cggccggggc cgtcggaggg 
agggtggcca cggccggtga acccaacttc 
gaaacgggcg accaaggcgc atcgctcgcg 
ctgacgctgc aaggcgacgt caagcggttc 
gctaccgctt gcgaggcttc gctcgatcaa 
ttgagcgctg cgatggccaa gcaggctcaa 
cgggaacatc cgacttatga agacatagtc 
tcggcccgcg accaaattct cccggtgtac 
ctgaccgaat acaacaacaa ggcagccctg 
gccatcaaga tcgacccgcc cccgcctccg 
atgccgccgt ctgacggctc cggtgtgacfc 
gttccgccta ccggatcgcc gggtggtggc 
tcggctgggc gggaagccgc agcgctgtcg 
ggtggcggtg gaggcggcgg ggtgccgtcg 
gaatcggtgc ggcccgctgg cgctggtgac 
ggcggcgccg cgctgggcgg cggtggcatg 
caagggggcg ccaagtccaa gggttctcag 
cgggcatgga ccgaggccgt cattggtaac 



cagcaagaga ttttgaacag ggccaacgag 60 
gatgtcccca tcacaccgtg cgaactcacg 12 0 . 
ttgtccgccg acaacatgcg ggaatacctg 180 
gcgacctcgc tgcgcaacgc ggccaaggcg 240 
gcgctggaca acgacggcga aggaactgtg 300 
gacagttcgg ccgaactaac cgatacgccg 360 
atggatctca aagaagcggc aaggaagctc 420 
cactttgcgg atgggtggaa cactttcaac 480 
cgggggtttg acaactggga aggcgatgcg 54 0 
caacggcaat ggatactcca catggccaaa 600 
tatgtcgcgc agctgcacgt gtgggctagg 660 
gggctcgaac ggctttacgc ggaaaaccct 72 0 
gcggagtatc agcagaggtc ggagaaggtg 780 
"gaaccggtaa acccgccgaa gcctcccccc 84 0 
caagagcagg gattgatccc tggcttcctg 900 
cccggtaccg ggatgccagc cgcaccgatg 960 
ctcccggctg acacggcggc gcagctgacg 1020 
ggcgacgtgg cggtcaaagc ggcatcgctc 1080 
gcgccgttgg gatccgcgat cgggggcgcc 114 0 
attgccggct taggccaggg aagggccggc 1200 
ggaatgccga tgggtgccgc gcatcaggga 1260 
caggaagacg aggcgctcta caccgaggat 1320 
cgtcggcgcc aggacagtaa ggagtcgaag 13 80 



<210> 25 
<211> 1386 
<212> DHA 

<213> Mycobacterium tuberculosis 
<22 0> 

<223> DNA sequence Rv3882c 



<400> 25 

atgagaaatc 

gccccgccat 

gcctcgttgg 

gtggcggcgg 

cctgtggtcg 

tttctggtcg 

gggcaagccc 

cactgtcccg 

gcgccggacg 

cgccggacct 

cgcgatgaag 

gatcgactgg 

cacgccaccg 

tacactgccg 

cacaccatca 

accacggcgg 

cccgcgctgc 

gctggggtac 

gtcgacatcg 

gcggcggcag 

ggcgcacaca 

ccgcatcccg 

cggcagctgc 

ccgaag 



ctttagggct 

gcatcatcgc 

gcgtcatcgt 

tgtacgcgtg 

gggccaccgt 

ccgtaatcga 

acaccgacga 

acttggaggc 

tggtgagtct 

ggatcgtgct 

gcgtcgcagg 

ctagccatgg 

acatcggctt 

cctacgcggc 

ccagagtccg 

acaagcccaa 

aaggccagca 

tggtcggcga 

ccctcaacct 

gtgcgatggt 

tcgggcagga 

gtattgaccg 

cgattcgccg 



gcggttcagc 

attcttggag 

ggccacggtc 

gttgcggcgg 

gaagccagga 

gctcattccc 

catgctggac 

cgatatcgtc 

gtatcagcag 

gcgcgccgac 

actggcccgg 

tgtcgacgcg 

tgtgcgggag 

gcccggaggt 

ggtcgcgccg 

gacacccagg 

tctggtggcc 

gacggtgaac 

gggtgacgct 

cacagtcggg 

ggtaaaggtg 

ggtgattctg 

ggtttcccca 



accgggcacg 

acgcgctact 

actttctacg 

cgccgacggc 

gatcacgttg 

cgaccattca 

accggactgg 

tcagccggct 

gtgatcggga 

ccggaacgca 

tatttggtgg 

gtgtgtggcc 

aaatggtcga 

ccggatgtat 

gggatggccc 

ggcttcgccc 

aaccgccact 

cgatgcccgg 

cagacattca 

ccacagttcg 

gcgtggccga 

cggcacaatg 

cccgaggaaa 



ccttgcttgc 

ggtgggcggg 

gccgccggat 

ccccggattc 

cggtgcgctg 

cgccgacggt 

tggaggagct 

accgcgtcgg 

cagacccggc 

cccgcaaatc 

cgtccgcgac 

gcagcttcga 

tgatcaaggg 

ggtggtcggc 

cgcagtccac 

gcctatttgg 

gccagctgcc 

tctacatgcc 

cccagttcgt 

aggaatttgc 

atgcgacgac 

tgatcggtac 

gccgctacca 



ctccgcgttg 

gattgcgctg 

caccggctgg 

ctcgtcagaa 

gcaaggcgag 

catcgtcgac 

cctgtcggtg 

caataccgca 

gccggcgaac 

ggcgcagcgc 

gcgcattgcc 

tgactacgac 

gcgcgatgcc 

gcgcgcggac 

ggtgttgctg 

cgggcagcgg 

gatcgggtca 

cttcgacgat 

ggtgcgtgcg 

ccggttgatc 

ctatctcggc 

cccgcggcat 

gatggcgctg 



60 

12 0 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1386 
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<210> 26 
<211> 1338 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> DNA sequence Rv3 883c 



gtgcaccgta tctttctgat cacggtggcg ctggcgttgc tcaccgcgtc gcccgcatcg 60 
gccatcacgc caccgccgat cgatccgggc gcgttgccgc ccgacgtgac gggcccggat 120 
cagcctaccg aacagcgcgt tttgtgcgcg tcgcccacca cgctgccggg gtccgggttc 180 
cacgatccgc cgtggagcaa cacgtatctg ggcgtggccg atgcccacaa gttcgcgacc 240 
ggggccgggg tgacggtggc ggtgatcgac accggtgtcg acgcttcgcc acgggtcccg 300 
|cigaac!?i gSggcgattt cgtcgaccag gccggtaacg gcctgtctga ctgtgatgcc 360 
catgggactc tcacagcatc catcatcgcg ggccggcccg cgcccaccga cgggttcgtc 420 
ggcgtcgcgc ccgacgctcg actgctctcg ctacgtcaga cgtctgaggc cttcgaaccg 480 
gtcggctcac aagccaaccc gaatgacccc aacgccaccc cggccgccgg ttccatccgc 540 
agtcttgccc gcgccgtggt gcacgccgcc aacctcggcg tgggtgtgat caacatcagt 600 
gaagccgcct gctacaaggt gagcaggccg atcgatgaaa cctcactggg tgcatccatc 660 
gactatgcgg tcaacgtcaa aggcgtggtg gtggtggtcg cggccggcaa caccggtggc 720 
gattgcgtac agaatccggc gccggacccg tccacacccg gcgacccacg cggctggaac 7 80 
aatgtgcaga ccgttgtcac cccggcgtgg tacgcaccgc tggtgttaag cgtcggcggt 840. 
atcggccaga ccgggatgcc cagctcgttc tcgatgcacg gaccgtgggt ggacgtggcc 900 
gcgcccgcag aaaacatcgt cgcgctcggc gacaccggtg aaccggtgaa tgcgctgcaa 960 
ggccgggagg ggccggtacc catcgccggc acctcgtttg ccgcggcata tgtgtcgggt 1020 
SggcSccc ^IctLggca gcggttcccc gacctgacgc cggcgcagat catccaccgg 1080 
atcaccgcca ccgcgagaca ccccgggggc ggggtcgacg acctggtcgg cgccggcgtc 1140 
atcgatgcgg tggccgcgct gacgtgggac attccgcccg gccctgcttc ggcgccatac 1200 
aacitcagic gacttccacc cccggtggtg gagccgggtc ccgatcgtcg ^^^gattacg 1260 
gctgtggcgt tggtggccgt cggccttacg ttggccctgg gcctgggcgc gctggctaga 132 0 
cgggcgctga gccgccga 

<210> 27 
<2il> 1857 
<212> DNA • 

<213> Mycobacterium tuberculosis 
<220> 

<223> DNA sequence Rv3884c 

atgtcgagaa tggtggacac gatgggtgat ttactcactg cgcgccggca tttcgatcgg 60 
golatlaSga tcaagaatgg ccagggatgc gtggcggcgt tgcctgagtt ^gtggctgcc 120 
Iclgalgccg atccgtcgat ggccgacgcg tggctgggtc gtatcgcctg cggtgaccgc 180 
gatctSccl cgcttaagca gctcaacgcc catagcgagt ggctgcaccg cgagaccacg 240 
?ggatcggcc giacgttggc cgctgaggtc cagctgggac catccatcgg Satcacggtg 300 
acSgaclcat ILalgtggg gctggcgctg tcgtcggcgt tgacgatcgc 9SSS9^9^^^ 360 
gcglaggccg atgccctgtt agcaaaccgc gagctattgg attcgtggcg ^aactaccag 420 
?gLatcagc tggctcgggc gttcctgatg tacgtcacgc agcgatggcc cgacgtgttg 480 
tclacggccg cSaggl?lt gccgccacag gcgatcgtca tgccggcggt gaccgcgtcg 540 
at?tgtgcgc tggcagccca cgccgccgcc catctcgggc aggggcgagt ggccctggac. 600 
?qac?ggacc gStgiacgt gatcggacac agcaggtcat cggagcggtt cggcgccgac 669 
gg^tllc^g SdStclg Lcggccgat attccgctgc tggtcgccga cttggcgtat 720 
gtgcggggga tggtgtaccg gcaactgcat gaggaggaca aggcccagat ctggctgtcg 780 
iSccSca tcLcgggg? ictcaccgac gccgccaaag aagccctggc S^acccgaac 840 
cti?gcttga ttgttaccga tgaacgaacc atcgccagcc gctccgaccg ttgggatgct 900 
tciacggcga aaagccgcga ccagctcgat gacgacaatg cagcgcagcg gcgcggcgag 960 
ct|ctlic?g agggccggga actgctggcc aaacaggtgg gcctggcggc Sgtcaagcaa 1020 
gcigtatcgg cgctggaaga ccaactcgag gtgcgcatga tgcgcctaga gcacggccta 1080 
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ccggtggagg ggcagaccaa ccacatgttg ctggtggggc caccaggcac aggtaagaca 1140 
accaccgctg aagcgctcgg caagatctac gccggcatgg ggatcgtgcg tcaccccgaa 1200 
attcgagaag ttcgccgatc ggacttctgt gggcactaca tcggggagtc aggacccaag 1260 
acgaacgagc tgatcgaaaa gtcactcggg cgaatcattt tcatggacga gttctactcg 1320 
ctgatcgaac gtcatcaaga cggaacaccg gacatgatcg gcatggaggc ggtcaatcaa 1380 
ctcctggttc aattggaaac acaccgattc gacttctgtt tcatcggggc cggctatgag 1440 
gatcaggtgg atgaattect' caccgtgaac ccgggtttgg ctggccggtt caaccgaaag 1500 
ctgcggttcg agtcttattc gccggtggag atcgtcgaga ttggacaccg ctacgctaca 1560 
ccgcgcgcca gccagctcga tgacgccgca cgggaggtat tcctcgacgc ggtcaccacc 1620 
atccgtaact acaccacccc tagtgggcag cacggtatcg acgctatgca aaacggtcgg 1680 
ttcgcccgca acgtgatcga acgcgccgaa gggttccggg acaccdgggt ggttgcgcaa 1740 
aaacgtgcgg gccaaccggt atcggttcag gatctgcaga tcatcaccgc caccgacatc 1800 
gatgccgcga tacgcagcgt gtgctcagac aaccgagaca tggccgcgat cgtttgg 1857 

<210> 28 
<211> 1611 
<212> DNA 

<213> Mycobacterium tv±>erculosis 
<220> 

<223> DNA sequence Rv3885c 

ttgacgtcca agctgaccgg gttcagcccg cgcagtgcga ggcgggtcgc cggggtgtgg 60 
acagtgttcg tgctcgcgtc ggcgggctgg gcgctgggcg gccagctagg tgcggtcatg 120 
gctgtcgtgg tcggcgtcgc cttggtgttc gtacagtggt ggggtcagcc ggcgtggtcg 180 
tgggcggtac tggggctgcg gggtcggcgt cccgtcaaat ggaatgaccc aattaccttg 240 
gccLcLcc ga?ccggigg tggcgtccgc gtgcaagacg gtgtcgcggt 3gtggcggtg 300 
caacttctcg gccgagcgca ccgggcgact acggtcaccg ggtcggtgac cgtagaaagc 360 
gacaacgtga ttgacgtcgt tgagctcgcg ccgttgctgc gccacccgct ggacctggaa 420 
ctcgattcaa tcagcgtcgt caccttcggc tcgcgaaccg gcaccgtcgg cgattacccg 480 
cgggtgtatg acgcggagat cggtacgccg ccgtatgccg ggcggcgcga aacgtggctg 540 
alcatgcggc ttccggtgat cggcaacacc caagctttac gctggcgtac cagcgttggg 60p 
gccgclgcSa tttcggtcgc ccaacgcgtt gccagctccc tgcgctgtca gggcttgcgc 660 
Iccaaactgg ccaccgcaac, agacttggct gagcttgatc gccggctggg gtcggacgcg . 720 
gtagccggga gtgcgcagcg ctggaaagct atccgcggtg aagccgggtg gatgacgacg 780 
?at|cg?a?c Sglclgaggc gatttcgtcg cgggttctct cgcaagcctg 9-^g=tgcgt 840 
gccgatgagg tcatccagaa cgtaacggtg tatccggacg cgacgtgcac cgcgaccatc 900 
Lcltgcgca caccgacgcc ggcgcctacc ccgcccagtg tgatcttgcg tcggctcaat 960 
ggtiaiclag ccgccgcggc tgcggccaac atgtgcgggc cacgtccaca cctacgcgga 1020 
Sgcggcgc? gcccgttgcc ggcgcagcta gtcaccgaga tcggaccgtc gggggtgttg 1080 
at^glcalgc ?gagcaacgg ggaccggctg atgattcccg ttaccgacgc cggtgagctg 1140 
tcgcgcgtct tcgtggccgc ggacgacacg atcgccaaga ggatcgtgat tcgcgtcgtc 1200 
ggSdgtg agJgStgtg tgtgcacact cgcgaccaag agcgttgggc cagcgtacgc 1260 
llgccgllgt tgagcatcgt cggcacacca cggcccgcgc cgcgcaccac tgtcggcgtc 1320 
. gtigagtacg tgcggcggcg caagaacggc gatgacggca aatctgaagg cagcggtgtc 1380 
?2Itcgcga tLcicccac gccacggcca gccagtgtca tcaccattgc ccgacccggt 1440 
LcLgctit ccgagagcga tcggcatggc ttcgaggtga ccatcgaaca aatcgatcgg 1500 
gcaacggtga aagtcggtgc ggcaggacaa aactggctgg ttgagatgga aatgttccgt 1560 
gcggagaacc gctatgtcag ccttgagccg gtcacgatgt cgataggccg g leax 

<210> 29 
<211> 620 
<212> DNA 

<213> Mycobacterium tuberculosis 



<220> 

<223> CFP-10 + ESAT-6 
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<400> 29 ^ • 

atggcagaga tgaagaccga tgccgctacc ctcgcgcagg aggcaggtaa tttcgagcgg 60 

atctccggcg acctgaaaac ccagatcgac caggtggagt cgacggcagg ttcgttgcag 120 

ggccagtggc gcggcgcggc ggggacggcc gcccaggccg cggtggtgcg cttccaagaa 180 

gcagccaata agcagaagca ggaactcgac gagatctcga cgaatattcg tcaggccggc 240 

gtccaatact cgagggccga cgaggagcag cagcaggcgc tgtcctcgca aatgggcttc 300 

tgacccgcta atacgaaaag aaacggagca aaaacatgac agagcagcag tggaatttcg 360 

cgggtatcga ggccgcggca agcgcaatcc agggaaatgt cacgtccatt cattccctcc 420 

ttgacgaggggaagcagtcc ctgaccaagc tcgcagcggc ctggggcggt agcggttcgg 480 

aggcgtacca gggtgtccag caaaaatggg acgccacggc taccgagctg aacaacgcgc 540 

tgcagaacct ggcgcggacg atcagcgaag ccggtcaggc aatggcttcg accgaaggca 600 
acgtcactgg gatgttcgca 

<210> 30 

<211> 21 ' ■ 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer SP6-BAC1 

<40D> 30 21 
agttagctca ctcattaggc a 

<210> 31 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer T7-BAC1 

<400>. 31 2X 
ggatgtgctg caaggcgatt a 

<210> 32 

<211> 20 . 
<212> DNA 

'<213> Artificial Sequence 

<220> . ^ ^„ 

<223> Description of Artificial Sequence: Prxmer esat-6F 

<400> 32 20 
gtcacgtcca ttcattccct 

<210> 33 
<211> 19 
<212> DNA 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: Primer esat-6R 

<400> 33 "19 
. atcccagtga cgttgcctt 
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<210> 34. 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Secjuence : Primer RDl' 
flanking region F 

<400> 34 

gcagtgcaaa ggtgcagata 



<210> 35 
<211> 20 
<212>. DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer RDl' 
flanking region R 

<400> 35 

gattgagaca cttgccacga 



<210> 36 
<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sec[uence: Primer 
plcA.int.F 

-<400> 36 

caagttgggt ctggtcgaat 



<210> 37 * 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer 
plcA.int.R 

<400> 37 

gctacccaag gtctcctggt 



<210> 38 
<211> 153 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223'> Sequences at the jiuiction RDl^ 
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<400> 38 _^ 4. iTA 

caagacgagg ttgtaaaacc tcgacgcagg atcggcgatg aaatgccagt cggcgtcgct 60 

gagcgcgcgc tgcgccgagt cccattttgt cgctgatttg tttgaacagc gacgaaccgg 120 

tgttgaaaat gtcgcctggg tcggggattc cct . 

<210> 39 . ... 

<211> 19 - 
<212> DNA 

<213> Artificial Sequence 

<220> ndc ■ 

<223> Description of Artificial Sequence: Primer RD5 
flanking region F 

<400> 39 19 
gaatgccgac gtcatatcg 

<2lb> 40 

<211> 20 . . ' 

<212> DNA 

<213> Artificial Sequence 

<220> . 
<223> Description of Artificial Sequence: Primer RD5 
flanking region R 

<400> 40 " 20 

cggccactga gttcgattat 

<210> 41 
<211> 152 
<212> DNA 

<213> Mycobacterium tuberculosis 

<220> ' ' . ■ ' 

<223> Sequence at tlie junction RDS"^*" 



cctcgatgaa ccacctgaca tgaccccatc ctttccaaga actggagtct ccggacatgc 
cggggcggtt cactgcccca ggtgtcctgg gtcgttccgt tgaccgtcga gtccgaacat 
ccgtcattcc cggtggcagt cggtgcggtg ac 



60 
120 
152 



<210> 42 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<220> . ^. • 

<223> Description of Artificial Sequence: Primer MiDl 

flanking region F 

<400> 42 .20 
cagccaacac caagtagacg 



<210> 43 
<211> 20 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer MiDl 
flanking region R 

<400> 43 . 

tctacctgca gtcgcttgtg 20 



<210> 44 
<211> 123 
<212> DNA 

<213> Mycobacterium tuberculosis 
<220> 

<223> Sequence at the junction MiDl 
<400> 44 

cacctgacat gaccccatcc tttccaagaa ctggagtctc cggacatgcc ggggcggttc 
agggacattc atgtccatct tctggcagat cagcagatcg cttgttctca gtgcaggtga 
gtc 



60 
120 
123 



<210> 45 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer MiD2 
f laiiking region R . 

<400> 45 

gtccatcgag gatgtcgagt 

<2l6> 46 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Primer MiD2 
flanking region L 

<400> 46 

ctaggccatt ccgttgtctg 



<210> 47 
<211> 151 
<212> DNA 

<213> Mycobacterium tuberculosis 

<220> 

<223> Sequence at the junction MiD2 



<400> 47 

gctgcctact acgctcaacg ccagagacca gccgccggct gaggtctcag atcagagagt 
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120 
151 



ctccggactc accggggcgg ttcataaagg cttcgagacc ggacgggctg taggttcctc 
aactgtgtgg cggatggtct gagcacttaa c 

<210> 48 

<211> 15 . 
<212> DNA . 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Prxmer MiD3 
flanking region R 

<400> 48 

ggcgacgcca tttcc 

<210> 49 
<211> 19 
<212> DNA 

<213> Artificial Sequence 

<220> . ^ 

<223> Description of Artificial Sequence: Primer MiD3 

flanking region L 
<400> 49 

aactgtcggg cttgctctt 

<210> 50 
<211> 181 
<212> DNA 

<213> Mycobacterium, tuberculosis 

<220> *,...■ 
<223> Sequence at the junction MiD3 

tggcgccggc acctccgttg ccaccgttgc cgccgctggt gggcgcggtg ccgttcgccc 60 
cSc?gaI?c gttcagggcc gggttcgccc tcagccgcta aacacgccga ^caagatcaa 120 
clagctacct gcccggtcaa ggttgaagag cccccatatc agcaagggcc cggtgtcggc 180 



<210> 51 
<211> 108 
<212> PRT 

<213> Mycobacterium tuberculosis 
<22d> 

<223> RV3861 - hypothetical protein 

JarThr^Trp Leu Ala Asp Pro Val Gly Asn Ser Arg He Ala Arg Ala 
1 5 10 15 . 

Gin Ala cys Lys Thr Ser He Ser Ala Pro He Val Glu Ser Trp Arg 
20 25 30 

Ala Gin Arg Gly Ala Gin Gys Gly Gin Arg Glu Lys Ser Cys Arg Cys 
35 .40 -45 



wo 03/085098 



33/66 



PCT/IB03/01789 



Ser Arg Ala Val His He Gin Gly He Ser Pro Pro Leu PEe Arg Arg 
50 55 . 60 

Pro Leu Glu Pro Ala Val Gin Ala Ala Val Ala Ser Cys Arg Leu Gly 
65 70 75 80 

Ara His Pro Val Val Ala His Arg Val Thr Val Ala Leu Gly Gin Gly 
85 • 90 ' 95 

Ser Gin Leu Ala Gin Arg Glu Cys Pro Arg Pro Ala 
100 105 



<210> 52 . . 

<211> 116 
<212> PRT 

<213> Mycobacterium tuberculosis 
<220> 

<223> WHIB6 - Possible transcriptional regulatory 
protein WHIB-like WHIB6 

Met^Arg^Tyr Ala Phe Ala Ala Glu Ala Thr Thr Cys Asn Ala Phe Trp 
1 5 10 15 

Arg Asn Val Asp Met Thr Val Thr Ala Leu Tyr Glu Val Pro Leu Gly 
20 25 30 

Val cys Thr Gin Asp Pro Asp Arg Trp Thr Thr Thr Pro Asp Asp Glu 
35 40 45 

Ala Lys Thr Leu Cys Arg Ala Cys. Pro Arg Arg Trp Leu Cys Ala Arg 
50 .55 60 

AsD Ala Val Glu Ser Ala Gly Ala Glu Gly Leu Trp Ala Gly Val Val 
65 70 75 • . 80 

He Pro Glu Ser Gly Arg Ala Arg Ala Phe Ala Leu Gly Gin Leu Arg 
85 90 95 

ser Leu Ala Glu Arg Asn Gly Tyr- Pro Val Arg Asp His Arg Val Ser 
100 • 105 110 

Ala Gin Ser Ala 
115 



<210> 53 
<211> 392 
<212> PRT 

<213> Mycobacterium tuberculosis 
<220> 

<223> RV3863 - hypothetical alanine rich protexn 

Met^Ala^Gly Glu Arg Lys Val Cys Pro Pro Ser Arg Leu Val Pro Ala 
1 5 10 15 
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Asn Lys Gly Ser Thr Gin Met Ser Lys Ala Gly Ser Thr Val Gly Pro 
20 25 30 

Ala Pro Leu Val Ala Cys Ser Gly Gly Thr Ser Asp Val lie Glu Pro 
35 40 45 

Arg Arg Gly Val Ala lie He Gly His Ser Cys Arg Val Gly Thr Gin 
50 55 60 

He Asp Asp ser Arg lie Ser Gin Thr His Leu Arg Ala Val Ser Asp 
65 70 . 75 80 

Asp Gly Arg Trp Arg He Val Gly Asn He Pro Arg Gly Met Phe Val 
85 . 90 . ^5 

Glv Gly Arg Arg Gly Ser Ser Val Thr Val Ser Asp Lys Thr Leu He 
100 105 110 

Arg Phe Gly Asp Pro Pro Gly Gly Lys Ala Leu Thr Phe Glu Val Val 
115 120 125 

Arg Pro Ser Asp Ser Ala Ala Gin His Gly Arg Val Gin Pro Ser Ala 
130 135 140 

ASP Leu Ser Asp Asp Pro Ala His Asn Ala Ala Pro Val Ala Pro Asp 
145 ISO 155 160 

Pro Gly Val Val Arg Ala Gly Ala Ala Ala Ala Ala Arg Arg Arg Glu 
165 .170 175 

Leu Asp He Ser Gin Arg Ser Leu Ala- Ala Asp Gly He He Asn Ala 
180 . 185 . 190 

Gly Ala Leu He Ala Phe Glu Lys Gly Arg Ser Trp Pro Arg Glu Arg . 
195 200 205 

Thr Arg Ala Lys Leu Glu Glu Val Leu Gin Trp Pro Ala Gly Thr He 
210 215 220 

Ala Arg He Arg Arg Gly Glu Pro Thr Glu Pro Ala Thr Asn Pro Asp 
225 230 •: 235 240 

Ala Ser Pro Gly Leu Arg Pro Ala Asp Gly Pro Ala Ser Leu He Ala 
245 250 255 

Gin Ala Val Thr Ala Ala Val Asp Gly Cys Ser Leu Ala He Ala Ala 
260 265 270 

Leu Pro Ala Thr Glu Asp Pro Glu Phe Thr Glu Arg Ala Ala Pro He 
275 280 285 

Leu Ala Asp Leu Arg Gin Leu Glu Ala He Ala Val Gin Ala Thr Arg 
290 295 300 

He Ser Arg He Thr Pro Glu Leu-Ile Lys Ala Leu Gly Ala Val Arg 
305 310 315 320 

Arg His His Asp Glu Leu Met Arg Leu Gly Ala Thr Ala Pro Gly Ala 
325 330 335 

Thr Leu Ala Gin Arg Leu Tyr Ala Ala Arg Arg Arg Ala Asn Leu Ser 
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340 345 350 

Thr Leu Glu Thr Ala Gin Ala Ala Gly Val Ala Glu Glu Met lie Val 
355 360 365 

Gly Ala Glu Ala gIu Glu Glu Leu Pro Ala Glu Ala Thr Glu Ala lie 
370 375 380 

Glu Ala Leu lie Arg Gin He Asn 
385 390 



<210> 54 
<211> 402 
<212> PRT 

<213> Mycobacterium tuberculosis 

<220> 

<223> RV3864 - conserved hypothetical protein 
<400> 54 

Met Ala Ser Gly Ser Gly Leu Cys Lys Thr Thr Ser Asn Phe He Trp 
1 5 .10 15 

Gly Gin Leu Leu Leu Leu Gly Glu Gly He Pro Asp Pro Gly Asp He 
20 25 30 

Phe Asn Thr Gly Ser Ser Leu Phe Lys Gin He Ser Asp Lys Met Gly 
35 ' 40 45 

Leu Ala He Pro Gly Thr Asn Trp He Gly Gin Ala Ala Glu Ala Tyr 
50 55 60 

Leu Asn Gin Asn He Ala Gin Gin Leu Arg Ala Gin Val Met Gly Asp 
65 .70 75 . 

Leu Asp Lys Leu Thr Gly Asn Met He Ser Asn Gin Ala Lys Tyr Val 
85 90 95 

Ser Asp Thr Arg Asp Val Leu Arg Ala Met Lys Lys Met He Asp Gly 
100 105 110 

Val Tyr Lys Val Cys Lys Gly Leu Glu Lys He Pro Leu Leu Gly His 
115 120 125 

Leu Trp Ser Trp Glu Leu Ala He Pro Met Ser Gly lie Ala Met Ala 
130 135 140 

Val Val Gly Gly Ala Leu Leu Tyr Leu Thr He Met Thr Leu Met Ash 
145 150 155 160 

Ala Thr Asn Leu Arg Gly lie Leu Gly Arg Leu He Glu Met Leu Thr 
165 170 . 175 

Thr Leu Pro Lys Phe Pro Gly Leu Pro Gly Leu Pro Ser Leu Pro Asp 
180 185 190 

He He Asp Gly Leu Trp Pro Pro Lys Leu Pro Asp He Pro He Pro 
195 200 205 

Gly Leu Pro Asp He Pro Gly Leu Pro Asp Phe Lys Trp Pro Pro Thr 



wo 03/085098 PCT/IB03/01789 

36/66 

210 215 220 

Pro Gly Ser Pro Deu Phe Pro Asp Leu Pro Ser Plae Pro Gly Phe Pro 
225 230 235 240 

Gly Phe Pro Glu Phe Pro Ala lie Pro Gly Phe Pro Ala Leu Pro Gly 
245 - 250 255 

Leu Pro Ser lie Pro Asn Leu Phe Pro Gly Leu Pro Gly Leu Gly Asp 
260 265 270 

Leu Leu Pro Gly Val Gly Asp Leu Gly Lys Leu Pro Thr Trp Thr Glu 
275 280 285 

Leu Ala Ala Leu Pro Asp Phe Leu Gly Gly Phe Ala Gly Leu Pro Ser 
290 295 300 

Leu Gly Phe Gly Asn Leu Leu Ser Phe Ala Ser Leu Pro Thr Val Gly 
305 310 315 320 

Gin Val Thr Ala Thr Met Gly Gin Leu Gin Gin Leu Val Ala Ala Gly 
325 330 335 

Gly Gly Pro Ser Gin Leu Ala Ser Met Gly Ser Gin Gin Ala Gin Leu 
340 345 350 , 

He Ser Ser Gin Ala Gin Gin Gly Gly Gin Gin His Ala Thr Leu Val 
355 360 365 

Ser Asp Lys Lys Glu Asp Glu Glu Gly Val Ala Glu Ala Glu Arg Ala 
370 375 380 

Pro He Asp Ala Gly Thr Ala Ala Ser Gin Arg Gly Gin Glu Gly Thr 
385 390 395 400 

Val Leu 



<210> 55 
<211> 103 
<212> PRT 

<213> Mycobacterium tuberculosis . 
<220> 

<223> Rv3865 - conserved hypothetical protein 

<400> 55 , ' n 

Met Thr Gly Phe Leu Gly Val Val Pro Ser Phe Leu Lys -Val Leu Ala 
1 5 XO 15 

Gly Met His Asn Glu He Val Gly Asp He Lys Arg Ala Thr Asp Thr 
20 25 30 

Val Ala Gly He Ser Gly Arg Val Gin Leu Thr His Gly Ser Phe Thr 
35 40 45 

Ser Lys Phe Asn Asp Thr Leu Gin Glu Phe Glu Thr Thr Arg Ser Ser 
50 55 60 

Thr Gly Thr Gly Leu Gin Gly Val Thr Ser Gly Leu Ala Asn Asn Leu 
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65 



70 75 80 



Leu Ala Ala Ala Gly Ala Tyr Leu Lys Ala Asp Asp Gly Leu Ala Gly 
85 90 95 

Val He Asp Lys lie Phe Gly 
100 



<210> 56 
<211> 283 
<212> PRT 

<213> Mycobacterium tuberculosis 
<220> 

<223> RV3866 - conserved hypothetical protein 
<400> 56 

Met Thr Gly Pro Ser Ala Ala Gly Arg Ala Gly Thr Ala Asp Asn Val 
X 5 10 15 

Val Gly Val Glu Val Thr He Asp Gly Met Leu Val He Ala Asp Arg 
20 25 .30 

Leu His Leu Val Asp Phe Pro Val Thr Leu Gly He Arg Pro Asn He 
35 ■ 40 45 

Pro Gin Glu Asp Leu Arg Asp He Val Trp Glu Gin Val Gin Arg Asp 
50 55 60 

Leu Thr Ala Gin Gly Val Leu Asp Leu His Gly Glu Pro Gin Pro Thr 
65 70 75 80 

. Val Ala Glu Met Val Glu Thr Leu Gly Arg Pro Asp Arg Thr Leu Glu 

85 - 90 , 95 

Gly Arg Trp Trp Arg Arg Asp He Gly Gly Val Met Val Arg Phe Val 
100 105 110 

val Cys Arg Arg Gly Asp Arg His Val He Ala Ala Arg Asp Gly Asp 
115 120 125 

Met Leu Val Leu Gin Leu Val Ala Pro Gin Val Gly Leu Ala Gly Met 
130 135 140 . 

Val Thr Ala Val Leu Gly Pro Ala Glu Pro Ala Asn Val Glu Pro Leu 
145 150 . 155. 160 

Thr Gly Val Ala Thr Glu Leu Ala Glu Cys Thr Thr Ala Ser Gin Leu 
165 170 175 

Thr Gin Tyr Gly He Ala Pro Ala Ser Ala Arg Val Tyr Ala Glu He 
180 185 190 

Val Gly Asn Pro Thr Gly Trp Val Glu He Val Ala Ser Gin Arg His 
195 200 205 

Pro Gly Gly Thr Thr Thr Gin Thr Asp Ala Ala Ala Gly Val Leu Asp 
210 215 220 

Ser Lys Leu Gly Arg Leu Val Ser Leu Pro Arg Arg- Val Gly Gly Asp 



225 
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230 235 240 



Leu Tyr Gly Ser Phe Leu Pro Gly Thr Gin Gin Asn Leii Glu Arg Ala 
245 250 255 

Leu Asp Gly Leu Leu Glu Leu Leu Pro Ala Gly Ala Trp Leu Asp His 
260 265 270 . 

Thr Ser Asp His Ala Gin Ala Ser Ser Arg Gly 
275 280 



<210> 57 
<211> 183 
<212> PRT 

<213> tnycobacterium tiaberculosis 

<220> 

<223> Protein sequence Rv3B67 
<400> 57 

Met Val Asp Pro Pro Gly Asn Asp Asp Asp His Gly Asp Leu Asp Ala 
15 10 15 

Leu Asp Phe Ser Ala Ala His Thr Asn Glu Ala Ser Pro Leu Asp Ala 
20 25 30 

Leu Asp Asp Tyr Ala Pro Val Gin Thr Asp Asp Ala Glu Gly Asp Leu 
35 40 45 

Asp A:ia Leu His Ala Leu Thr Glu Arg Asp Glu Glu Pro Glu Leu Glu 
50 55 ' 60 

Leu Phe Thr Val Thr Asn Pro Gin Gly Ser Val Ser Val Ser Thr Leu 
65 70 75 80 

Met Asp Gly Arg lie Gin His Val Glu Leu Thr Asp Lys Ala Thr Ser 
85 .90 95 

Met Ser Glu Ala Gin Leu Ala Asp Glu lie Phe Val lie Ala Asp Leu 
100 105 110 

Ala Arg Gin Lys Ala Arg Ala Ser Gin Tyr Thr Phe Met Val Glu Asn 
115 120 125 

lie Gly Glu Leu Thr Asp Glu Asp Ala Glu Gly Ser Ala Leu Leu Arg 
. 130 135 140 

Glu Phe Val Gly Met Thr Leu Asn Leu Pro Thr Pro Glu Glu Ala Ala 
145 150 155 160 

Ala Ala Glu Ala Glu Val Phe Ala Thr Arg Tyr Asp Val Asp Tyr Thr 
165 170 175 

Ser Arg Tyr Lys Ala Asp Asp 
180 



<210> 58 
<211> 573 
<212> PRT 
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<213> mycobacterium tuberculosis 

<220> 

<223> Protein sequence Rv3868 
<400> 58 

Met Thr Asp Arg Leu Ala Ser Leu Phe Glu Ser Ala Val Ser Met Leu 
1- . 5 -10 15 

Pro Met Ser Glu Ala Arg Ser Leu Asp Leu Phe Thr Glu lie Thr Asn 
20 25 .30 

Tyr Asp Glu Ser Ala Cys Asp Ala Trp lie Gly Arg lie Arg Cys Gly 

35 . 40 - 45 

Asp Thr Asp Arg Val Thr Leu Phe Arg Ala Trp Tyr Ser Arg Arg Asn 
50 ' 55 60 

Phe Gly Gin Leu Ser Gly Ser Val Gin He Ser Met Ser Thr Leu Asn 
65 70 75 80 

Ala Arg He Ala He Gly Gly Leu Tyr Gly Asp lie Thr Tyr Pro Val 
.85 90 95 

Thr Ser Pro Leu Ala He Thr Met Gly Phe Ala Ala Cys Glu Ala Ala 
100 105 110 

Gin Gly Asn Tyr Ala Asp Ala Met Glu Ala Leu Glu Ala Ala Pro Val 
115 120 125 

Ala Gly Ser Glu His Leu Val Ala Trp Met Lys Ala Val Val Tyr Gly 
130 135 140 

Ala Ala Glu Arg Trp Thr Asp Val He Asp Gin Val Lys Ser Ala Gly 
145 150 . 155 160 

LVS Trp Pro Asp Lys Phe Leu Ala Gly Ala Ala Gly Val Ala His Gly 
.165 170 ,175 

Val Ala Ala Ala Asn Leu Ala Leu Phe Thr Glu Ala Glu Arg Arg Leu 
180 185 190 

Thr Glu Ala Asn Asp Ser Pro Ala Gly Glu Ala Cys Ala Arg Ala He 
195 200 205 

Ala Trp Tyr Leu Ala Met Ala Arg Arg Ser Gin Gly Asn Glu Ser Ala 
210 215 220 

Ala Val Ala Leu Leu Glu Trp Leu Gin Thr Thr His Pro Glu Pro Lys 
225 230 235 240 

Val Ala Ala Ala Leu Lys Asp Pro Ser Tyr Arg Leii Lys Thf Thr Thr- 
.245 250 255 

Ala Glu Gin He Ala Ser Arg Ala Asp Pro Trp Asp Pro Gly Ser Val 
260 265 270 

Val Thr Asp Asn Ser Gly Arg Glu Arg Leu Leu Ala Glu Ala Gin Ala 
275 280 285 

Glu Leu Asp Arg Gin He Gly Leu Thr Arg Val Lys ..Asn Gin He Glu 
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290 . 



29S 300 



RXQ Tyx Arg Ala Ala Thr Leu Met Ala Arg Val Arg Ala Ala Lys Gly 
305 310 315 320 

Met Lys Val Ala Gin Pro Ser Lys His Met He" Phe Thr Gly Pro Pro 

325 . -330 •■ 335 

Gly Thr Gly Lys Thr Thr He Ala Arg Val Val Ala Asn He Leu Ala 
340 345 350 

Gly Leu Gly Val He Ala. Glu Pro Lys Leu Val Glu Thr Ser Arg Lys 
355 360 365 . . 

Asp Phe Val Ala Glu Tyr Glu Gly Gin Ser Ala Val Lys Thr Ala Lys 
370 375 380 . 

Thr He Asp Gin Ala Leu Gly Gly Val Leu Phe He Asp Glu Ala Tyr 
385 390 395 400 

Ala Leu Val Gin Glu Arg Asp Gly Arg Thr Asp Pro Phe Gly Gin Glu 
405 4l0 415 

Ala Leu Asp Thr Leu Leu Ala Arg Met Glu Asn Asp Arg Asp Arg Leu 
420 425 . 430 

Val Val He He Ala Gly Tyr Ser Ser Asp He Asp Arg Leu Leu Glu 
435 440 . . 445 

Thr Asn Glu Gly Leu Arg Ser Arg Phe Ala Thr Arg He Glu Phe Asp 
450 455 460. 

Thr Tvr Ser Pro Glu Glu Leu Leu Glu He Ala Asn Val He Ala Ala 

465 . 470 - 475 . 480 

Ala Asp Asp Ser Ala Leu Thr Ala Glii Ala Ala Glu Asn Phe Leu Gin 
485 490 495 

Ala Ala Lys Gin Leu Glu Gin Arg Met Leu Arg Gly Arg Arg Ala Leu 
500 505 510 

ASP val Ala Gly Asn Gly Arg Tyr Ala Arg Gin Leu Val Glu Ala Ser 
515 . 520 . 525 

Glu Gin Cys Arg Asp Met Arg Leu Ala Gin Val Leu Asp He Asp Thr 
530 535 540 

Leu Asp Glu Asp Arg Leu Arg Glu He Asn Gly Ser Asp Met Ala Glu 
545 550 555 560 

Ala He Ala Ala Val His Ala His Leu Asn Met Arg Glu 
565 570 



<210> 59 

<211> 480 
<212> PRT 

<213> mycobacterium tuberculosis 



<220> 

<223> protein sequence Rv3869 
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<400> 59 

Met Gly Leu Arg Leu Thr Thr Lys Val Gin Val Ser Gly Trp Arg Phe 
1 5 10 15 

Leu Leu Arg Arg Leu Glu His Ala lie Val Arg Arg Asp Thr Arg Met 
20 25 30 . 

Phe Asp Asp Pro Leu Gin Phe Tyr Ser Arg Ser lie Ala Leu Gly He 
35 40 45 

Val Val Ala Val Leu He Leu Ala Gly Ala Ala Leu Leu Ala Tyr Phe 
50 55 60 

Lys Pro Gin Gly Lys Leu Gly Gly Thr Ser Leu Phe Thr Asp Arg Ala 
65 70 75 80 

Thr Asn Gin Leu Tyr Val Leu Leu Ser Gly Gin Leu His Pro Val Tyr 
85 • 90 95 

Asn Leu Thr Ser Ala Arg Leu Val Leu Gly Asn Pro Ala Asn Pro Ala 
100 105 110 . 

Thr Val Lys Ser Ser Glu Leu Ser Lys Leu Pro Met Gly Gin Thr Val 
115 120 125 

Gly He Pro Gly Ala Pro Tyr Ala Thr Pro Val Ser Ala Gly Ser Thr 
130 135 140 

Ser lie Trp Thr Leu Cys Asp Thr Val Ala Arg Ala Asp Ser Thr Ser 
145 150 155 160 

Pro Val Val Gin Thr Ala Val He Ala Met Pro Leu Glu He Asp Ala 
165 170 . 175 

Ser He Asp Pro Leu Gin Ser His Glu Ala Val Leu Val Ser Tyr Gin 
180 .185 190 

Gly Glu Thr Trp He Val Thr Thr Lys Gly Arg His Ala He Asp Leu 
195 200 205 

Thr Asp Arg Ala Leu Thr Ser Ser Met Gly He Pro Val Thr Ala Arg 
210 215 220 

Pro Thr Pro He Ser Glu Gly Met Phe Asn Ala Leu Pro Asp Met Gly 
225 230 235 240 

Pro Trp Gin Leu Pro Pro He Pro Ala Ala Gly Ala Pro Asn Ser Leu 
245 250 255 

Gly Leu Pro Asp Asp Leu Val He Gly Ser Val Phe Gin He His Thr 
260 265 270 

Asp Lys Gly Pro Gin Tyr Tyr Val Val Leu Pro Asp Gly He Ala Gin 
275 280 285 

Val Asn Ala Thr Thr Ala Ala Ala Leu Arg Ala Thr Gin Ala His Gly 
290 295 300 

Leu Val Ala Pro Pro Ala Met Val Pro Ser Leu Val Val Arg He Ala 
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305 310 315 320 

Glu Arg Val Tyr Pro Ser Pro Leu Pro Asp Glu Pro Leu Lys lie Val 
325 330 335 

ser Arg Pro Gin Asp Pro Ala Leu Cys Trp Ser Trp Gin Arg Ser Ala 
■ 340 345 . 350 

Gly Asp Gin Ser Pro Gin Ser Thr Val Leu Ser Gly Arg His Leu Pro 
355 360 365 

lie Ser Pro Ser Ala Met Asn Met Gly He Lys Gin He His Gly Thr 
370 375 380 

Ala Thr Val Tyr Leu Asp Gly Gly Lys Phe Val Ala Leu Gin Ser Pro 
385 390 395 400 

ASP Pro Arg Tyr Thr Glu Ser Met Tyr Tyr He Asp Pro Gin Gly Val 
405 ' " 410 415 

Arg Tyr Gly Val Pro Asn Ala Glu Thr Ala Lys Ser Leu Gly Leu Ser 
420. 425 430 

Ser Pro Gin Asn Ala Pro Trp Glu He Val Arg Leu Leu Val Asp Gly 
435 440 445 

Pro val Leu Ser Lys Asp Ala Ala Leu Leu Glu His Asp Thr Leu Pro 
450 455 460 

Ala Asp pro ser Pro Arg Lys Val Pro. Ala Gly Ala Ser Gly Ala Pro 

^^r- 470 475 



465 



<210> 60 
<211> 747 
<212> PRT 

<213> mycobacterium tuberculosis 
<220> 

<223> Proteia sequence Rv3870 



<400> 60 

Met Tbr Tbr Lys Lys 
1 



Phe Thr Pro Thr lie Thr Arg Gly Pro Arg Leu 
5: .10 15 . 



Thr Pro Gly Glu lie Ser Leu Thr Pro Pro Asp Asp Leu Gly He Asp 
20 25 30 

He pro Pro Ser Gly Val Gin Lys He Leu Pro Tyr Val Met Gly Gly 



35 



40 



45 



Ala Met Leu Gly Met He Ala He Met Val Ala Gly Gly Thr Arg Gin 



50 



55 60 



Leu 
65 



Ser Pro Tyr Met Leu Met Met Pro Leu Met Met He Val Met Met 

— — 80 



70 75 



val Gly Gly Leu Ala Gly Ser Thr Gly Gly Gly Gly Lys Lys Val Pro 



85 90 
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Glu lie Asn Ala Asp Arg Lys Glu Tyr Leu Arg Tyr Leu Ala Gly Leu 
100 105 110 

Arg Thr Arg Val Thr Ser Ser Ala Thr Ser Gin Val Ala Phe Phe Ser 
115 120 125 

Tyr His Ala Pro His Pro Glu Asp Leu Leu Ser lie Val Gly Thr Gin 
130 135 140 

Arg Gin Trp Ser Arg Pro Ala Asn Ala Asp Phe Tyr Ala Ala Thr Arg 
145 150 155 160 

lie Gly lie Gly Asp Gin Pro Ala Val Asp Arg Leu Leu Lys Pro Ala 
165 170 175 

Val. Gly Gly Glu Leu Ala Ala Ala Ser Ala Ala Pro Gin Pro Phe Leu 
180 185 190 

Glu Pro Val ser His Met Trp Val Val Lys Phe Leu. Arg Thr His Gly 
195 . 200 205 

Leu lie His Asp Cys- Pro Lys Leu Leu Gin Leu Arg Thr Phe Pro Thr 
210 215 • 220 

lie Ala He Gly Gly Asp Leu Ala Gly Ala Ala Gly Leu Met Thr Ala 
225 230 . 

Met He Cys His Leu Ala Val Phe His Pro Pro Asp Leu Leu Gin He 
245 250 255 

Arg Val Leu Thr Glu Glu Pro Asp Asp Pro Asp Trp Ser Trp Leu Lys 
260 265 . 270 

Trp Leu Pro His Val Gin His Gin Thr Glu Thr Asp Ala Ala Gly Ser 
275 280 285 

Thr Arg Leu He Phe Thr Arg Gin Glu Gly Leu Ser Asp Leu Ala Ala 
290 295 300 

Arg Gly Pro His Ala Pro Asp Ser Leu Pro Gly Gly Pro Tyr Val Val 
305 . 310 315 320 

Val Val Asp Leu Thr Gly Gly Lys Ala Gly Phe Pro Pro Asp Gly Arg 
325 330 335 

Ala Gly Val Thr Val He Thr Leu Gly Asn His Arg Gly Ser Ala Tyr 
340 345 350 

Arg He Arg Val His Glu Asp Gly Thr Ala Asp Asp Arg Leu Pro Asn 
355 360 365 

Gin Ser Phe Arg Gin Val Thr Ser Val Thr Asp Arg Met Ser Pro Gin 
370 375 380 . 

Gin Ala Ser Arg He Ala Arg Lys Leu Ala Gly Trp Ser He Thr Gly 
. 385 390 395 400 

Thr He Leu Asp Lys Thr Ser Arg Val Gin Lys Lys Val Ala Thr Asp 
405 410 415 

Trp His Gin Leu Val Gly Ala Gin Ser Val Glu Glu He Thr Pro Ser 
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420 



425 430 



Arg Trp Arg Met Tyr Thr Asp Thr Asp Arg Asp Arg Leu Lys lie Pro 
435 440 445 

Phe Gly His Glu Leu Lys Thr Gly Asn Val Met Tyr Leu Asp lie Lys 
450 455 460 

Glu Gly Ala Glu Phe Gly Ala Gly Pro His Gly Met Leu lie Gly Thr 
465 470 475 480 

Thr Gly Ser Gly Lys Ser Glu Phe Leu Arg Thr.. Leu He Leu Ser Leu 
485 490 495 

Val Ala Met Thr His Pro Asp Glii Val. Asn Leu Leu Leu Thr Asp Phe 
500 505 510 

Lvs Gly Gly Ser Thr Phe Leu Gly Met Glu Lys Leu Pro His Thr Ala 
515 520 525 

Ala val val Thr Asn Met Ala Glu Glu Ala Glu Leu Val Ser Arg Met 
530 535 540 

GlY Glu val Leu Thr Gly Glu Leu Asp Arg Arg Gin Ser He Leu Arg 
545 550 555 . . 560 

Gin Ala Gly Met Lys Val Gly Ala Ala Gly Ala Leu Ser Gly Val Ala 
565 570 575 

Glu Tyr Glu Lys Tyr Arg Glu Arg Gly Ala Asp Leu Pro Pro Leu Pro 
580 585 590 

Thr Leu Phe Val Val Val Asp Glu Phe Ala Glu Leu Leu Gin Ser His 
595 600 .605 

Pro Asp Phe He Gly Leu Phe Asp Arg He Cys Arg Val Gly Arg Ser 
610 615 .620 

Leu Arg Val His Leu Leu Leu Ala Thr Gin Ser Leu Gin Thr Gly Gly 
625 630 635 640 

Val Arg He Asp Lys Leu Glu Pro Asn Leu Thr Tyr Arg He Ala Leu 
645 650 655 

Arg Thr Thr Ser Ser His Glu Ser Lys Ala- Val He Gly Thr Pro Glu 
660 • 665 670 

Ala Gin Tyr He Thr Asn Lys Glu Ser Gly Val Gly Phe Leu Arg Val 
675 680 685 

Gly Met Glu Asp Pro Val Lys Phe Ser Thr Phe Tyr He Ser Gly Pro 
690 695 .700 

Tyr Met Pro Pro Ala Ala Gly Val Glu Thr Asn Gly Glu Ala Gly Gly 
705 710 715 720 

Pro Gly Gin Gin Thr Thr Arg Gin Ala Ala Arg He His Arg Phe Thr 
725 730 735 

Ala Ala Pro Val Leu Glu Glu Ala Pro Thr Pro 
740 745 
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<210> 61 
<211> 591 
<212> PRT 

<213> mycobacterium tuberculosis 
<220> 

<223> Protein sequence Rv3 871 
<400> 61 

Met Thr Ala Glu Pro Glu Val Arg rhr Leu Arg Glu Val Val Leu Asp 

1 5 • 10 15 . 

Gin Leu Gly Thr Ala Glu Ser Arg Ala Tyr Lys Met Trp Leu Pro Pro 
20 25 30 

Leu Thr Asn Pro Val Pro Leu Asn Glu Leu lie Ala Arg Asp Arg Arg 
35 40 45 

Gin Pro Leu Arg Phe Ala Leu Gly lie Met Asp Glu Pro Arg Arg His 
50 . 55 60 

Leu Gin Asp Val trp Gly Val Asp Val Ser Gly Ala Gly Gly Asn lie 
65 70 75 80 

Gly lie Gly Gly Ala Pro Gin Thr Gly Lys Ser Thr Leu Leu Gin Thr 
85 90 95 

Met Val Met Ser Ala Ala Ala Thr His Ser Pro Arg Asn Val Gin Phe. 
100 105 110 

Tyr CVS lie Asp Leu Gly Gly Gly Gly Leu lie Tyr Leu Glu Asn Leu 
115 . ' 120 ^25 

Pro His val Gly Gly Val Ala Asn Arg Ser Glu Pro Asp Lys Val Asn 
130 .135 140 

Arg Val Val Ala Glu Met Gin Ala Val Met Arg Gin Arg Glu Thr Thr 

-icn '. T^S 160 

145 150 

Phe Lys Glu His Arg Val Gly Ser lie Gly Met Tyr Arg Gin Leu Arg 
165 170 175 

Aso ASP Pro Ser Gin Pro Val Ala Ser Asp Pro Tyr Gly Asp Val Phe 
180 185 190 

Leu lie lie Asp Gly Trp Pro Gly Phe Val Gly Glu Phe Pro Asp Leu 
195 200 205 

Glu Gly Gin Val Gin Asp Leu Ala Ala Gin Gly Leu Ala Phe Gly. Val 
210 215 220 

His val lie He Ser Thr Pro Arg Trp Thr Glu Leu Lys Ser Arg Val 
225 230 235 240 

Arq ASP Tyr Leu Gly Thr Lys He Glu Phe Arg Leu Gly Asp Val Asn 
245 250 - 255. 



Glu 



Thr Gin He Asp Arg He Thr Arg Glu He Pro Ala Asn Arg Pro 
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260 



265 270 



Gly Arg Ala Val Ser Met Glu Lys His His Leu Met He Gly Val Pro 
275 280 285 

Arg Phe Asp Gly Val His Ser Ala Asp Asn Leu Val Glu Ala He Thr 
290 295 300 

Ala Gly val Thr Gin He Ala Ser Gin His Thr Glu Gin Ala Pro Pro 

Tift 315 320 

305 310 -SJ-^ 

val Arg Val Leu Pro Glu Arg He His Leu His Glu Leu Asp Pro Asn 
325 330 

Pro Pro Gly Pro Glu Ser Asp Tyr Arg Thr Arg Trp Glu He Pro He 
340 345 350 

Gly Leu Arg Glu Thr Asp Leu Thr Pro Ala His Cys His Met His Thr 
355 360 365 

Asn Pro His Leu Leu He Phe Gly Ala Ala Lys Ser Gly Lys Thr Thr 
. 370 37S 380 

He Ala His Ala He Ala Arg Ala He Cys Ala Arg Asn Ser Pro Gin 
385 390 395 400 

Gin val Arg Phe Met Leu Ala Asp Tyr Arg Ser Gly Leu Leu Asp Ala 
405 410 " 

val Pro Asp Thr His Leu Leu Gly Ala Gly Ala He Asn Arg Asn Ser 
420 .425 430 

Ala Ser Leu Asp Glu Ala Val Gin Ala Leu Ala Val Asn Leu Lys Lys 
435 440 445 • 

Arg Leu Pro Pro Thr Asp Leu Thr Thr Ala Gin Leu Arg Ser Arg Ser 

■ 450 455 . ■ . ■ 460 

Trp Trp ser Gly Phe Asp Val Val Leu Leu Val Asp Asp Trp His Met 
, 465 470 475 480 

He Val Gly Ala Ala Gly Gly. Met Pro Pro Met Ala Pro Leu Ala Pro 
485 490 495 

Leu Leu Pro Ala Ala Ala Asp He Gly Leu His He He Val Thr Cys 
500 .505 510 

Gin Met Ser Gin Ala Tyr Lys Ala Thr Met Asp Lys Phe Val Gly Ala 
515 52 0 525 

Ala Phe Gly Ser Gly Ala Pro Thr Met Phe Leu Ser Gly Glu Lys Gin 
530 535 540 



Glu Phe Pro 
545 



ser Ser Glu Phe Lys Val Lys Arg Arg Pro Pro Gly Gin 
550 555 560 



Ala Phe- Leu Val Ser Pro Asp Gly Lys Glu Val He Gin Ala Pro Tyr 
565 570 575 

He Glu Pro Pro Glu Glu Val Phe Ala Ala Pro Pro Ser Ala Gly 
580 585 590 
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<210> 62 
<211> 99 
<212> PRT 

<213> Mycobacterium tuberculosis 
<220> 

<223> RV3872-PE3S - PE family-related protein 

Mlt^Glu^Lys Met Ser His Asp Pro lie Ala Ala Asp lie Gly Thr Gin 
1 5 10 .15 

Val Ser Asp Asn Ala Leu His Gly Val Thr Ala Gly Ser Thr Ala Leu 
20 25 30 

Thr ser Val Thr Gly Leu Val Pro Ala Gly Ala Asp Glu Val Ser Ala 
35 . 40 45 

Gin Ala Ala Thr Ala Phe Thr Ser Glu Gly lie Gin Leu Leu Ala Ser 
50 55 60 . 

Asn Ala ser Ala Gin Asp Gin Leu His Arg Ala Gly Glu Ala. Val Gin 
65 70 75 80 

Asp Val Ala Arg Thr Tyr Ser Gin lie Asp Asp Gly Ala Ala. Gly Val 
. . 85 • • 90. 95 

Phe Ala Glu 



<210> 63 
<211> 368' 

<212>. PRT . . 

<213> Mycobacterium tooberculosis 

<220> 

i:223> RV3873-PPE68 - PPE family protein 

Stt^Leu^Trp His Ala Met Pro Pro Glu Leu Asn Thr Ala Arg Leu Met 
1 . 5 .10 15 

Ala Gly Ala Gly Pro Ala Pro Met Leu Ala Ala Ala Ala Gly Trp Gin 
20 25 30 

Thr Leu ser Ala Ala Leu Asp Ala Gin Ala Val Glu Leu Thr Ala Arg 
35 40 45 

Leu Asn ser- Leu Gly Glu Ala Trp Thr Gly Gly Gly Ser Asp Lys Ala 
50 55 60 



Leu 
65 



Ala Ala Alk Thr Pro Met Val Val Trp Leu Gin Thr Ala Ser Thr 
70 75 80 

Gin Ala Lys Thr Arg Ala Met Gin Ala Thr Ala Gin Ala Ala Ala Tyr 
85 90 95 

Thr Gin Ala Met Ala Thr Thr Pro Ser Leu Pro Glu He Ala Ala Asn 
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100 



105 110 



His He Thr Gin Ala Val Leu Thr Ala Thx Asn Phe Phe Gly He Asn 
115 120 125 

Thr He Pro He Ala Leu Thr Glu Met Asp Tyr Phe He Arg Met Trp 
130 • 135 140 

Asn Gin Ala Ala Leu Ala Met Glu Val Tyr Gin Ala Glu Thr Ala Val 
145 150 155 160 

Asn Thr Leu Phe Glu Lys' Leu Glu Pro Met Ala Ser He Leu Asp Pro 
165 170 175 

Gly Ala Ser Gin Ser Thr Thr Asn Pro He Phe Gly Met Pro Ser Pro 
180 185 190 

Gly ser Ser T^xr Pro Val Gly Glh Leu Pro Pro Ala Ala Thr Gin Thr 
195 200 205 

Leu Gly Gin Leu Gly Glu Met Ser Gly Pro Met Gin Gin Leu Thr Gin 
210 215 -220 

Pro Leu Gin Gin Val Thr Ser Leu Phe Ser Gin Val Gly Gly Thr Gly 
225 230 235 24.0. 

Glv Gly Asn Pro Aia Asp Glu Glu Ala Ala Gin Met Gly Leu Leu Gly 
245 250 255 

Thr Ser Pro Leu Ser Asn His Pro Leu Ala Gly Gly Ser Gly Pro Ser 
260 265 270 

Ala Gly Ala Gly Leu Leu Arg Ala Glu Ser Leu Pro Gly Ala Gly Gly 
275 280 285 

Ser Leu Thr Arg Thr Pro Leu Met Ser Gin Leu He Glu Lys Pro Val 
290 " 295 300 

Ala Pro Ser Val Met Pro Ala Ala Ala Ala Gly Ser Ser Ala Thr Gly 

o-iA 315 320 

305 310 

Gly Ala Ala Pro Val Gly Ala Gly Ala Met Gly Gin Gly Ala Gin Ser 
325 330 335 

Glv Gly Ser Thr Arg Pro Gly Leu Val Ala Pro Ala Pro Leu Ala Gin 
340 345 350 

Glu Arg Glu Glu Asp Asp Glu Asp Asp Trp Asp Glu Glu Asp Asp Trp 
355 360 365 



<210> 64 
<211> 100 
<212> PRT 

<2i3> Mycobacterium tuberculosis 



<220> 

<223> Rv3874-esxB - lOkDa culture filtrate antigen 
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<400> 64 

Met Ala Glu Met Lys Thr Asp Ala Ala Thr- Leu Ala Gin Glu Ala Gly 
1 5 10 15 

Asn Phe Glu Arg lie Ser Gly Asp Leu Lys Thr Gin He Asp Gin Val 
20 25 30 

Glu Ser Thr Ala Gly Ser Leu Gin Gly Gin Trp Arg "Gly Ala Ala Gly 
35 40 45 

Thr Ala Ala Gin Ala Ala Val Val Arg Phe Gin Glu Ala Ala Asn Lys 
50 55 60 

Gin Lys Gin Glu Leu Asp Glu He Ser Thr Asn He Arg Gin Ala Gly 
65 ' - 70 75 80 

Val Gin Tyr Ser Arg Ala Asp Glu Glu Gin Gin Gin Ala Leu Ser Ser 
85 90 95 

Gin Met Gly Phe 
100 



<210> 65 
<211> 95 
<212> PRT 

<213> Mycobacterium tuberculosis 

<220> . . 

<223> Rv3875-Esat6 - 6 kDa early secretory antigenic 
target Esat6 (Esat-6) 

<400> 65 

Met Thr Glu Gin Gin Trp Asn Phe Ala Gly He Glu Ala Ala Ala Ser 
1.5 10 15 

Ala He Gin Gly Asn Val Thr Ser lie His 6er Leu Leu Asp Glu Gly 
20 25 .30 

Lys Gin Ser Leu Thr Lys Leu Ala Ala Ala Trp Gly Gly Ser Gly Ser 
35 40 45 

Glu Ala Tyr Gin Gly Val Gin Gin Lys Trp Asp Ala Thr Ala Thr Glu 
50 55 . 60 

Leu Asn Asn Ala Leu Gin Asn Leu Ala Arg Thr He Ser Glu Ala Gly 
65 70 75 80 

Gin Ala Met Ala Ser Thr Glu Gly Asn Val Thr Gly Met Phe Ala 
85 90 95 



<210> 66 
<211> 666 
<212> PRT 

<213> inycobacterium tuberculosis 
<220> 

<223> Protein sequence Rv3876 



<400> 66 



wo 03/085098 



50/66 



PCT/IB03/01789 



Met 
1 



Ala Ala Asp Tyr Asp Lys Leu Phe Arg Pro His Glu Gly Met Glu 



5 10 



Ala Pro Asp Asp Met Ala Ala Gin Pro Phe Phe Asp Pro Ser Ala Ser 
.20 25 30 

Phe Pro Pro Ala Pro Ala Ser Ala Asn Leu Pro Lys Pro Asn Gly Gin 
35 40 45 

Thr Pro Pro Pro THr Ser Asp Asp Leu Ser Glu Arg Phe Val Ser Ala 
50 55 60 

Pro pro Pro Pro Pro Pro Pro Pro Pro Pro Pro Thr Pro Met 
70 75 . 80 



Pro Pro 
65 

Pro lie Ala Ala Gly Glu Pro Pro Ser Pro Glu Pro Ala Ala Ser Lys 
85 90 95 

Pro Pro Thr Pro Pro Met Pro lie Ala Gly Pro Glu Pro Ala Pro Pro 
100 .105 110 

Lys Pro Pro Thr Pro Pro Met Pro He Ala Gly Pro Glu Pro Ala Pro 
il5 120 125 

pro Lys Pro Pro Thr Pro Pro Met Pro He Ala Gly Pro .Ala Pro Thr 
130 135 140 -. 

Pro Thr Glu Ser Gin Leu Ala Pro Pro Arg Pro Pro Thr Pro Gin Thr 
145 150 155 

pro Thr Gly Ala Pro Gin Gin Pro Glu Ser Pro Ala Pro His Val Pto 
165 170 175 

ser His Gly Pro His Gin Pro Arg Arg Thr Ala Pro Ala Pro Pro Trp 
180 - 185 190 

Ala liys Met Pro He Gly Glu Pro Pro Pro Ala Pro Ser Arg Pro Ser 
195 200 . 205 

Ala ser Pro Ala Glu Pro Pro Thr Arg Pro Ala Pro Gin His Ser Arg 



210 



215 220 



Arg Ala Arg Arg Gly His Arg Tyr Arg Thr Asp thr Glu Arg Asn Val 

230 235 



225 

Gly Lys val Ala Thr Gly Pro Ser He Gin Ala Arg Leu Arg Ala Glu 
^ 245 • 250 . 255 



Ala Pro Gly Thr Glu Pro Ser Pro Ala 

.265 270 



Glu Ala Ser Gly Ala Gin Leu 
260 

Pro Leu Gly Gin Pro Arg Ser Tyr Leu Ala Pro Pro Thr Arg Pro Ala 
275 280 285 

Glu Pro Pro Pro Ser Pro Ser Pro Gin Arg Asn Ser Gly Arg ■ 



Pro Thr 
290 



295 300 



Arg Ala Glu Arg Arg Val His Pro Asp Leu Ala Ala Gin His Ala Ala 
305 
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Ala Gin Pro Asp Ser lie Thr Ala Ala Thr Thr Gly Gly Arg Arg Arg 
325 330 335 

Lys Arg Ala Ala Pro Asp Leu Asp Ala Thr Gin Lys Ser Leu Arg Pro 
340 345 350 

Ala Ala Lys Gly Pro Lys Val Lys Lys Val Lys Pro Gin Lys Pro Lys 
355 360 365 

Ala Thr Lys Pro Pro Lys Val Val Ser Gin Arg Gly Trp Arg His Trp 
370 375 380 

Val His Ala Leu Thr Arg lie Asn Leu Gly Leu Ser Pro Asp Glu Lys 
385 390 395 400 

Tyr Glu Leu Asp Leu His Ala Arg Val Arg Arg Asn Pro Arg Gly Ser 
405 410 415 

Tyr Gin lie Ala Val Val Gly Leu Lys Gly Gly Ala Gly Lys Thr Thr 
420 425 430 

Leu Thr Ala Ala Leu Gly Ser Thr Leu Ala Gin Val Arg Ala Asp Arg 
435 440 445 

He Leu Ala Leu Asp Ala Asp Pro Gly Ala Gly Asn Leu Ala Asp Arg 
450 455 460 

Val Gly Arg Gin Ser Gly Ala Thr He Ala Asp Val Leu Ala Glu Lys 
465 470 475. 480 

. Glu Leu Ser His Tyr Asn Asp He Arg Ala His Thr Ser Val Asn. Ala 

485 490 495 

Val Asn Leu Glu Val Leu Pro Ala Pro Glu Tyr Ser Ser Ala Gin Arg 
500 505 510 

Ala Leu Ser Asp Ala Asp Trp His Phe He Ala Asp Pro Ala Ser Arg. 
515 520 525 ■ 

Phe Tyr Asn Leu Val Leu Ala Asp Cys Gly Ala Gly Phe Phe Asp Pro 
530 535 540 

Leu Thr Arg Gly Val Leu Ser Thr Val Ser Gly Val Val Val Val Ala 
545 550 555 560 

Ser Val Ser He Asp Gly Ala Gin Gin Ala Ser Val Ala Leu Asp Trp 
565 .570 575 

Leu Arg Asn Asn Gly Tyr Gin Asp Leu Ala Ser Arg Ala Cys Val Val 
580 585 590 . 

He Asn His He Met Pro Gly Glu Pro Asn Val Ala Val Lys Asp Leu 
595 600 605 

Val Arg His Phe Glu Gin Gin Val Gin Pro Gly Arg Val Val Val Met 
,610 615 . 620 

Pro Trp Asp Arg His He Ala Ala Gly Thr Glu He Ser Leu Asp Leu 
625 630 635 640 

Leu Asp Pro He Tyr Lys Arg Lys Val Leu Glu Leu Ala Ala Ala Leu 
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645 



52/66 
650 



655 



Ser Asp Asp Phe Glu Arg Ala Gly Arg Arg 
660 665 



<210> 67 
<211> 511 
<212> PRT 

<213> mycobacterium ttiberculosis 
<220> 

<22 3> Protein sequence Rv3877 
<40p> 67 

Met Ser Ala Pro Ala Val Ala Ala Gly Pro Thr Ala Ala Gly Ala Thr 
15 10 15 

Ala Ala Arg Pro Ala Thr Thr Arg Val Thr He Leu Thr Gly Arg Arg 
20 25 30 



Met Thr Asp Leu Val Leu Pro Ala Ala Val Pro Met Glu Thr Tyr He 
35 40 45 • 

Asp Asp Thr Val Ala Val Leu Ser Glu Val Leu Glu Asp Thr Pro Ala 



50 



55 



60 



Asp Val Leu Gly Gly Phe Asp Phe Thr Ala Gin Gly Val Trp Ala Phe 



65 



70 



Ala Arg Pro Gly Ser Pro Pro Leu Lys Leu Asp Gin Ser Leu Asp Asp 
85 90 ,95 



Ala 



Gly Val Val Asp Gly Ser Leu Leu Thr Leu Val Ser Val Ser Arg 



100 



105 



110 



Thr Glu Arg Tyr Arg Pro Leu Val Glu Asp Val He Asp Ala He Ala 
115 120 125 

Val Leu Asp Glu Ser Pro Glu Phe Asp Arg Thr Ala Leu Asn Arg Phe 
130 135 . 140 

Val Gly Ala Ala He Pro Leu Leu Thr Ala Pro Val He Gly Met Ala 



145 



150 



155 



160 



Met Arg Ala Trp Trp Glu Thr Gly Arg Ser Leu Trp Trp Pro Leu Ala 
165 170 175 

He Gly He Leu Gly He Ala Val Leu Val Gly Ser Phe Val Ala Asn 



180 



185 



190 



Arg Phe Tyr Gin Ser Gly His Leu Ala Glu Cys Leu Leu Val Thr Thr 
195 200. 205 

Tyr Leu 'Leu He Ala Thr Ala Ala Ala Leu Ala Val Pro Leu Pro Arg 



210 



215 



220 



Glv Val Asn Ser Leu Gly Ala Pro Gin Val Ala Gly Ala Ala Thr Ala 

230 . 235 240 



225 
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Val lieu Phe Leu Thr Leu Met Thr Arg Gly Gly Pro Arg Lys Arg His 
245 250 255 

Glu Leu Ala Ser Phe Ala Val lie Thr Ala lie Ala Val lie Ala Ala 
260 265 270 

Ala Ala Ala Phe Gly Tyr Gly Tyr Gin Asp Trp Val Pro Ala Gly Gly 
. 275 280 285 

He Ala Phe Gly Leu Phe lie Val Thr Asn Ala Ala Lys Leu Thr Val 
290 295 300 

Ala Val Ala Arg He Ala Leu Pro Pro He Pro Val Pro Gly Glu Thr 
305 310 315 320 

Val Asp Asn Glu Glu Leu Leu Asp Pro Val Ala Thr Pro Glu Ala Thr 
.325 330 335 

Ser Glu Glu Thr Pro Thr Trp Gin Ala He He Ala Ser Val Pro Ala 
340 345 350 

Ser Ala Val Arg Leu Thr Glu Arg Ser Lys Leu Ala Lys Gin Leu Leu 
355 360 365 

He Gly Tyr Val Thr Ser Gly Thr Leu He Leu Ala Ala Gly Ala He 
370 375 380 

Ala Val Val Val Arg Gly His Phe Phe Val His Ser Leu Val Val Ala 
385 390 395 400 

Gly Leu He Thr Thr Val Cys Gly Phe Arg Ser Arg Leu Tyr Ala Glu 
405 410 415 

Arg Trp Cys Ala Trp Ala Leu Leu Ala Ala Thr Val Ala lie Pro Thr 
420 425 430 

Gly Leu Thr Ala Lys Leu He He Trp Tyr Pro His Tyr Ala Trp Leu 

435 440 445 

Leu Leu Ser Val Tyr Leu Thr Val Ala Leu Val Ala Leu Val Val Val 
450 455 460 

Gly Ser Met Ala His Val Arg Arg Val Ser. Pro Val Val Lys Arg Thr 
465 470 475 480 

Leu Glu Leu He Asp Gly Ala Met He Ala Ala He He Pro Met Leu 
485 490 495 

Leu Trp He Thr Gly Val Tyr Asp Thr Val Arg Asn He Arg Phe 
500 505 510 



<210> 68 
<211> 280 
<212> PRT 

<213> Mycobacterium triberculosis 
<220> 

*<223> RV3878 - conserved hypothetical alanine rich 
protein 
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<400> 68 

Met Ala Glu Pro Leu Ala Val Asp Pro Thr Gly Leu Ser Ala Ala Ala 
1 5 10 15 

Ala Lys Leu Ala Gly Leu Val Phe Pro Gin Pro Pro Ala Pro He Ala 
20 25 30 

Val Ser Gly Thr Asp Ser Val Val Ala Ala He. Asn Glu Thr Met Pro 
35 ' 40 ' 45 

Ser He Glu' Ser Leu Val Ser Asp Gly Leu Pro Gly Val Lys Ala Ala 
50 55 60 

Leu Thr Arg Thr Ala Ser Asn Met Asn Ala Ala Ala Asp Val Tyr Ala 
65 70 75 80 

Lys Thr Asp Gin Ser Leu Gly Thr Ser Leu Ser Gin Tyr Ala Phe Gly 
85 90 95 

Ser Ser Gly Glu Gly Leu Ala Gly Val Ala Ser Val Gly Gly Gin Pro 
100 105 110 

Ser Gin Ala Thr Gin Leu Leu Ser Thr Pro Val Ser Gin Val Thr Thr 
115 120 125 

Gin Leu Gly Glu Thr Ala Ala Glii Leu Ala Pro Arg Val Val Ala Thr 
130 135 140 

Val Pro Gin Leu Val Gin Leu Ala Pro His Ala Val Gin Met Ser Gin 
145 150 155 160 

Asn Ala Ser Pro He Ala Gin Thr He Ser Gin Thr Ala Gin Gin Ala , 
165 170 175 

Ala Gin Ser Ala Gin Gly Gly Ser Gly Pro Met Pro Ala Gin Leu Ala 
180 185 190 

Ser Ala Glu Lys Pro Ala Thr Glu Gin Ala Glu Pro Val His Glu. Val 
195 200 205 

Thr Asn Asp Asp Gin Gly Asp Gin Gly Asp Val Gin Pro Ala Glu Val 
210 215 220 

Val Ala Ala Ala Arg Asp Glu Gly Ala Gly Ala Ser Pro Gly Gin Gin 
225 230 235 240 " 

Pro Gly Gly Gly Val Pro Ala Gin Ala Met Asp Thr Gly Ala Gly Ala 
245 250 '255 

Arg Pro Ala Ala Ser Pro Leu Ala Ala Pro Val Asp Pro Ser Thr Pro 
260 265 270 

Ala Pro Ser Thr Thr Thr Thr Leu 
275 280 



<210> 69 
<211> 729 
<212> PRT 

<213> Mycobacterium tuberculosis 
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<220> 

<223> Rv3879c - hypothetical alanine and proline rich 
protein 

<400> 69 

Met Ser lie Thr Arg Pro Thr Gly Ser Tyr Ala Arg Gin Met Leu Asp 
1 ,5 10 15 

Pro Gly Gly Trp Val Glu Ala Asp Glu Asp Thr Phe Tyr Asp Arg Ala 
20 25 \ 30 

Gin Glu Tyr Ser Gin Val Leu Gin Arg Val Thr Asp Val Leu Asp Thr 
35' 40 45 

Cys Arg Gin Gin Lys Gly His Val Phe Glu Gly Gly Leu Tip Ser Gly 
50 55-60 

Gly Ala Ala Asn Ala Ala Asn Gly Ala Leu Gly Ala Asn He Asn Gin 
65 70 75 80 

Leu Met Thr Leu Gin Asp Tyr Leu Ala Thr Val He Thr Trp His Arg 
85 90 95 

His He Ala Gly Leu He Glu Gin Ala Lys Ser Asp He Gly Asn Asn 
100 105 110 

Val Asp Gly Ala Gin Arg Glu He Asp He Leu Glu Asn Asp Pro Ser 
115 120 125 

Leu Asp Ala Asp Glu Arg His Thr Ala He Asn Ser Leu Val Thr Ala 
130 135 140 

Thr His Gly Ala Asn Val Ser Leu Val Ala Glu Thr Ala Glu Arg Val 
145 150 . 155 160 

Leu Glu Ser Lys Asn Tip Lys Pro Pro Lys Asn Ala Leu Glu Asp Leu 
165 170 175 

Leu Gin Gin Lys Ser Pro Pro Pro Pro Asp Val Pro Thr Leu Val Val 
180 185 .190 

Pro Ser Pro Gly Thr Pro Gly Thr Pro Gly Thr Pro He Thr Pro Gly 

195 200 . 205 

Thr Pro He Thr Pro Gly Thr Pro He Thr Pro He Pro Gly Ala Pro 
210 215 220 

Val Thr Pro He Thr Pro Thr Pro Gly Thr Pro Val Thr Pro Val Thr 
225 230 235 240 

Pro Gly Lys Pro Val Thr Pro Val Thr Pro Val Lys Pro Gly Thr Pro 
245 250 255 

Gly Glu Pro Thr Pro He Thr Pro Val Thr Pro Pro Val Ala Pro Ala 
260 265 270 

Thr Pro Ala Thr Pro Aia Thr Pro Val Thr Pro Ala Pro Ala Pro His 
275 280 285 

Pro Gin Pro Ala Pro Ala Pro Ala Pro Ser Pro Gly Pro Gin Pro Val 
290 295 300 
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Thr Pro Ala Thr Pro Gly Pro Ser Gly Pro Ala Thr Pro Gly Thr Pro 
305 310 315 320 

Gly Gly Glu Pro Ala Pro His Val Lys Pro Ala Ala Leu Ala Glu Gin 
325 . 330 335 

Pro Gly Val Pro Gly Gin His Ala Gly Gly Gly Thr Gin Ser Gly Pro 
340 345 350 

Ala His Ala Asp Glu Ser Ala Ala Ser Val Thr Pro Ala Ala Ala Ser 
355 360 365 

Gly Val Pro Gly Ala Arg Ala Ala Ala Ala Ala Pro Ser Gly Thr Ala 
370 375 380 

Val Gly Ala Gly Ala Arg Ser Ser Val Gly Thr Ala Ala Ala Ser Gly 
385 390 395 400 

Ala Gly Ser His Ala Ala Thr Gly Arg Ala Pro Val Ala Thr Ser Asp 
405 410 415 

Lys Ala Ala Ala Pro Ser Thr Arg Ala Ala Ser Ala Arg Thr Ala Pro 
420 425 430 

Pro Ala Arg Pro Pro Ser Thr Asp His He Asp Lys Pro Asp Arg Ser 
435 440 445 

Glu Ser Ala Asp Asp Gly Thr Pro Val Ser Met He Pro Val Ser Ala 
450 455 460 

Ala Arg Ala Ala Arg Asp Ala Ala Thr Ala Ala Ala Ser Ala Arg Gin 
465 470 475 480 

Arg Gly Arg Gly Asp Ala Leu Arg Leu Ala Arg Arg He Ala Ala Ala 
- 485 490 495 

Leu Asn Ala Ser Asp Asn Asn Ala Gly Asp Tyr Gly Phe Phe Trp. He 
500 505 510 

Thr Ala Val Thr Thr Asp Gly Ser He Val Val Ala Asn Ser Tyr Gly 
515 520 -525 

Leu Ala Tyr . He Pro Asp Gly Met Glu Leu Pro Asn Lys Val Tyr Leu 
530 535 540 

Ala Ser Ala Asp His Ala He Pro Val Asp Glu He Ala Arg Cys Ala 
545 550 555 560 

Thr Tyr Pro Val Leu Ala Val Gin Ala Trp Ala Ala Phe His Asp Met 
565 570 575 

. Thr Leu Arg Ala Val He Gly Thr Ala Glu Gin Leu Ala Ser Ser Asp 
580 585 . 590 

Pro Gly Val Ala Lys He Val Leu Glu Pro Asp Asp He Pro Glu Ser 
595 600 605 

Gly Lys Met Thr Gly Arg Ser Arg Leu Glu Val Val Asp Pro Ser Ala 
'eiO 615 620 
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T^a Ala Gin Leu Ala Asp Thr Thx Asp Gin Arg Leu Leu Asp Leu Leu 
625 630 635 640 

Pro Pro Ala Pro Val Asp Val Asn Pro Pro Gly Asp Glu Arg His Met . 

645 650 655 

Leu Trp Phe Glu Leu Met Lys Pro Met Thr Ser Thr Ala Thr. Gly Arg 
660 -665 . * 670 

Glu Ala Ala His Leu Arg Ala Phe Arg Ala Tyr Ala Ala His Ser Gin 
675 680 685 

Glu lie Ala Leu His Gin Ala His Thr Ala Thr Asp Ala Ala Val Gin 
690 695 • 700 

Arg Val Ala Val Ala Asp Trp Leu Tyr Trp Gin Tyr Val Thr Gly Leu 
705 710 715 720 

Leu Asp Arg Ala Leu Ala Ala Ala Cys 
725 



<210> 70 
<211> lis 
<212> PRT 

<213> Mycobacterium tuberculosis 
<220> 

<223> Rv38B0c - conserved hypothetical protein 
<400> 70 

Val Ser Met Asp Glu Leu Asp Pro His Val Ala Arg Ala Leu Thr Leu 
1- 5 10 15 

Ala Ala Arg Phe Gin Ser Ala Leu Asp Gly Thr Leu Asn Gin Met Asn 
.20 -25 , 30 

Asn Gly Ser Phe Arg Ala Thr Asp Glu Ala Glu Thr Val Glu Val Thr 
35 40 45 - 

lie Asn Gly His Gin Trp Leu Thr Gly Leu Arg He Glu Asp Gly Leu 
.50 55' 60 

Leu Lys Lys Leu Gly Ala Glu Ala Val Ala Gin Arg Val Asn. Glu Ala 
65 . 70 75 80 

Leu His Asn Ala Gin Ala Ala Ala Ser Ala Tyr Asn Asp Ala Ala Gly 
85 - 90 95 

Glu Gin Leu Thr Ala Ala Leu Ser Ala Met Ser Arg Ala Met Asn Glu 
100 105 110 

Gly Met Ala 
115 



<210> .71 
<211> 460 
<212> PRT 

<213> Mycobacterium tuberculosis 
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<220> 

<223> Rv3881c - conserved hypothetical alanine emd 
glycine rich protein 

<400> 71 

Met Thr Gin Ser Gin Thr Val Thr Val Asp Gin Gin Glu He Leu Asn 

1 5 10. 15 . 

Arg Ala Asn Glu Val Glu Ala Pro Met Ala Asp Pro Pro Thr Asp Val 
20 25 30 

Pro He Thr Pro Cys Glu Leu Thr Ala kla Lys Asn Ala Ala Gin Gin 

35 40. 45 

Leu Val Leu Ser Ala Asp Asn Met Arg Glu Tyr Leu Ala Ala Gly Ala 

50 ■ . "55 . 60 . 

Lvs Glu Arg Gin Arg Leu Ala Thr Ser Leu Arg Asn Ala Ala Lys Ala 
65 70 . ^° 

Tyr Gly Glu Val Asp Glu Glu Ala Ala Thr Ala Leu Asp Asn Asp Gly 
85 90 95 

Glu Gly Thr Val Gin Ala Glu Ser Ala Gly Ala Val Gly Gly Asp Ser 
100 105 110 

Ser Ala Glu Leu Thr Asp Thr Pro Arg Val Ala Thr Ala Gly Glu Pro 
115 120 125 

Asn Phe Met Asp Leu Lys Glu Ala Ala Arg Lys Leu Glu Thr Gly Asp 
.130 135 140 

Gin Gly Ala Ser Leu Ala His Phe Ala Asp Gly Trp Asn Thr Phe Asn 
145 150 . .155 160 

Leu Thr Leu Gin Gly Asp Val Lys Arg Phe Arg Gly Phe Asp Asn Trp 
165 170 175 

Glu Gly Asp Ala Ala Thr Ala Cys Glu Ala Ser Leu Asp Gin Gin Arg 
180 185 190 

Gin Trp He Leu His Met Ala Lys Leu Ser Ala Ala Met Ala Lys Gin 
195 200 205 

Ala Gin Tyr Val Ala Gin Leu His Val Trp Ala Arg Arg Glu His Pro 
210 215 220 

Thr Tyr Glu Asp He Val Gly Leu Glu Arg Leu Tyr Ala Glu Asn Pro 
225 230 235 240 

ser Ala Arg Asp Gin He Leu Pro Val Tyr Ala Glu Tyr Gin Gin Arg 
245 250 255 

Ser Glu Lys Val Leu Thr Glu Tyr Asn Asn Lys Ala Ala Leu Glu Pro 
260 265 270 

val Asn Pro Pro Lys Pro Pro Pro Ala He Lys He Asp Pro Pro Pro 
275 280 285 

Pro Pro Gin Glu Gin Gly Leu He. Pro Gly Phe Leu Met Pro Pro Ser 
290 - 295 300 
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Asp Gly Ser Gly Val tbr Pro Gly Thr Gly Met Pro Ala Ala Pro Met 
305 310 315 320 

Val Pro Pro Thr Gly Ser Pro Gly Gly Gly Leu Pro Ala Asp Thr Ala 
325 . 330 335 

Ala Gin Leu Thr Ser Ala Gly Arg Glu Ala Ala Ala Leu Ser Gly Asp 
340 345 350 

Val Ala Val Lys Ala Ala Ser Leu Gly Gly Gly Gly Gly Gly Gly Val 
355 360 365 

Pro Ser Ala Pro Leu Gly Ser Ala lie Gly Gly Ala Glu Ser Val Arg 
370 375 380 

Pro Ala Gly Ala Gly Asp lie Ala Gly Leu Gly Gin Gly Arg Ala Gly 
385 390 395 400 

Gly Gly Ala Ala Leu Gly Gly Gly Gly Met Gly Met Pro Met Gly Ala 
405 410 415 

Ala His Gin Gly Gin Gly Gly Ala Lys Ser Lys Gly Ser Gin Gin Glu 
420 425 430 

Asp Glu Ala Leu Tyr Thr Glu Asp Arg Ala Trp Thr Glu Ala Val lie 
435 440 445 

Gly Asn Arg Arg Arg Gin Asp Ser Lys Glu Ser Lys 
450 455 460 



<210> 72 
<211> 462 
<212> PRT 

<213> Mycobacterium tuberculosis 

<220> ... 
<223> Rv3 882c - possible conserved menibrane protexn . 

<400> 72 . . T 

Met Arg Asn Pro Leu Gly Leu Arg Phe Ser Thr Gly His Ala Leu Leu 
1 5 . 10 . 15 

Ala Ser Ala Leu Ala Pro Pro Cys lie lie Ala Phe Leu Glu Thr Arg 
20 25 30 

Tyr Trp Trp Ala Gly He Ala Leu Ala Ser Leu Gly Val He Val Ala 
35 . 40 • 45 

Thr Val Thr Phe Tyr Gly Arg Arg He Thr Gly Trp Val Ala Ala Val 
50 55 60 

Tvr Ala Trp Leu Arg Arg Arg Arg Arg Pro Pro Asp Ser Ser Ser Glu 
65 70 75 80 

Pro Val val Gly Ala Thr Val Lys Pro Gly Asp His Val Ala Val Arg 
85 90 . 55. 

Trp Gin Gly Glu Phe Leu Val Ala Val He Glu Leu He Pro Arg Pro 
100 105 HO 
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Phe Thr Pro Thr Val lie Val Asp Gly Gin Ala His Thr Asp Asp Met 
115 120 125 

Leu Asp Thr Gly Leu Val Glu Glu Leu Leu Ser Val His Cys Pro Asp 
130 135 140 

Leu Glu Ala Asp He Val Ser Ala Gly Tyr Arg Val Gly Asn Thr Ala 
145 150 155 160 

Ala Pro Asp Val Val Ser Leu Tyr Gin Gin Val He Gly Thr Asp Pro 
165 170 175 

Ala Pro Ala Asn Arg Arg Thr Trp He Val Leu Arg Ala Asp Pro Glu 
180 185 190 - 

Arg Thr Arg Lys Ser Ala Gin Arg Arg Asp Glu. Gly Val Ala Gly Leu 
195 200 205 

Ala Arg Tyr Leu Val Ala Ser Ala Thr Arg He Ala Asp Arg Leu Ala 
210 215 . 220 

Ser His Gly Val Asp Ala Val Cys Gly Arg Ser Phe Asp Asp Tyr Asp 
225 230 235 240 

His Ala Thr Asp He Gly Phe Val Arg Glu Lys Trp Ser Met He Lys 
245 250 255 

Glv Arg Asp Ala Tyr Thr Ala Ala Tyr Ala Ala Pro Gly Gly Pro Asp 
260 265 270 

val Trp Trp Ser Ala Arg Ala Asp His Thr He Thr Arg Val Arg Val 
275 280 285 

Ala Pro Gly Met Ala Pro Gin Ser Thr Val Leu Leu Thr Thr Ala Asp.. 
290 295 300 

Lys Pro Lys Thr Pro Arg Gly Phe Ala Arg Leu . Phe Gly Gly Gin Arg 
305 310 315 320 

Pro Ala Leu Glri Gly Gin His Leu Val Ala Asn Arg His Cys Gin Leu 
325 330 335 

Pro He Gly Ser Ala Gly Val Leu Val Gly Glu Thr Val Asn Arg Cys 
340 345 350 

Pro val Tyr Met Pro Phe Asp Asp Val Asp He Ala Leu Asn Leu Gly 
355 360 365 

Asp Ala Gin Thr Phe Thr Gin Phe Val Val Arg Ala Ala Ala Ala Gly 
370 375 380 

Ala Met Val Thr Val Gly Pro Gin Phe Glu Glu Phe Ala Arg Leu He 
. 385 390 395 400 

Gly Ala His He Gly Gin Glu Val Lys Val Ala Trp Pro Asn Ala Thr 
405 410 415 

Thr Tyr Leu Gly Pro His Pro Gly He Asp Arg Val He Leu Arg His 

420 425 430 ..- 
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Asn Val lie Gly Thr Pro Arg His Arg Gin Leu Pro lie Arg Arg Val 
435 - 440 445 

Ser Pro Pro Glu Glu Ser Arg Tyr Gin Met Ala Leu Pro Lys 
450 455 460 



<210> 73 
<211> 446 
<212> PRT 

<213> Mycobacterium t\iberculosis 
<220> 

<223> Rv3883c - possible secreted protease 
<400> 73 

Val His Arg lie Phe Leu lie Thr Val Ala Leu Ala Leu Leu Thr Ala 
1 5 10 .15 

Ser Pro Ala Ser Ala lie Thr Pro Pro Pro lie Asp Pro Gly Ala Leu 
20 25 30 

Pro Pro Asp Val Thr Gly Pro Asp Gin Pro Thr Glu Gin Arg Val Leu 
35 40 45 

Cys Ala Ser Pro Thr Thr Leu Pro Gly Ser Gly Phe His Asp Pro Pro 
50 55 60 

Trp Ser Asn Thr Tyr Leu Gly Val Ala Asp Ala His Lys Phe Ala Thr 
65 70 75 80 

Gly Ala Gly Val Thr Val Ala Val He Asp Thr Gly Val Asp Ala Ser 
85 90 95 

Pro Arg Val Pro Ala Glu Pro Gly Gly Asp. Phe Val Asp Gin Ala Gly 
100 105 110 

Asn Gly Leu Ser Asp Cys Asp Ala His Gly Thr Leu Thr Ala Ser lie 
115 120 . 125 

He Ala Gly Arg Pro Ala Pro Thr Asp Gly Phe Val Gly Val Ala Pro 
130 135 140 

Asp Ala Arg Leu Leu Ser Leu Arg Gin Thr Ser Glu Ala Phe Glu Pro 
145 150 155 160 

Val Gly Ser Gin Ala Asn Pro Asn Asp Pro Asn Ala Thr Pro Al^ Ala 
165 170 175 

Gly Ser He Arg Ser Leu Ala Arg Ala Val Val His Ala Ala Asn Leu 
180 185 190 

Gly Val Gly Val He Asn He Ser Glu Ala Ala Cys Tyr Lys Val Ser 
195 200 205^ . 

Arg Pro He Asp Glu Thr Ser Leu Gly Ala Ser He Asp Tyr Ala Val 
210 .215 220 

Asn Val Lys Gly Val Val Val Val Val Ala Ala Gly Asn Thr Gly Gly 
225 230 .235 240 
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Asp Cys Val Gin Asn Pro Ala Pro Asp Pro Ser Thr Pro Gly Asp Pro 
245 250 255 

Arg Gly Trp Asn Asn Val Gin Thr Val Val Thr Pro. Aia Trp Tyr Ala 
260 265 270 

Pro Leu Val Leu Ser Val Gly Gly lie Gly Gin Thr Gly Met Pro Ser 
275 280 285 

Ser Phe Ser Met His Gly Pro Trp Val Asp Val Ala Ala Pro Ala Glu 
290 295 300 

Asn lie Val Ala Leu Gly Asp Thr Gly Glu Pro Val Asn Ala Leu Gin 
305 310 315 320 

Gly Arg Glu Gly Pro Val Pro He Ala Gly Thr Ser Phe Ala Ala Ala 
325 330 335 

Tyr Val Ser Gly Leu Ala Ala Leu Leu Arg Gin Arg Phe Pro Asp Leu 
• 340 345 350 

Thr Pro Ala Gin He He His Arg He Thr Ala Thr Ala Arg His Pro 
355 360 365 

Gly Gly Gly Val Asp Asp Leu Val Gly Ala Gly Val He Asp Ala Val 
370 . • 375 380 - 

Ala Ala Leu Thr Trp Asp He Pro Pro Gly Pro Ala Ser Ala Pro Tyr 
385 390 395 400 

Asn Val Arg Arg Leu Pro Pro Pro Val Val Glu Pro Gly Pro Asp Arg 
405 .410 .415 

Arg Pro He Thr Ala Val Ala Leu Val Ala Val Gly Leu Thr Leu Ala 
420 425 430 . 

Leu Gly Leu Gly Ala Leu Ala Arg Arg Ala Leu Ser Arg Arg 
435 440 445 



<210> 74 
<211> 619 
<212> PRT 

<213> Mycobacterium tuberculosis 

<220> 

<223> RV3884C - probable CBXX/CFQX family protein 

<400> 74 ^ * , 

Met Ser Arg Met Val Asp Thr Met Gly Asp Leu Leu Thr Ala Arg Arg 
1 5 10 15 

His Phe Asp Arg Ala Met Thr He Lys Asn Gly Gin Gly Cys Val Ala 
20 25 .30 

Ala Leu Pro Glu Phe Val Ala Ala Thr Glu Ala Asp Pro Ser Met Ala 
35 40 45 

Asp Ala Trp Leu Gly Arg He Ala Cys Gly Asp Arg Asp Leu Ala Ser 
50 55 . 60. 
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Leu Lys Gin Leu Asn Ala His Ser Glu Trp Leu His Arg Glu Thr Thr 
65 70 75 80 

Arg He Gly Arg Thr Leu Ala Ala Glu Val Gin Leu Gly Pro Ser He 
85 90 95 

Gly He Thr Val Thr Asp Ala Ser Gin Val Gly Leu Ala Leu Ser Ser 
100 105 110 

Ala Leu Thr He Ala Gly Glu Tyr Ala Lys Ala Asp Ala Leu Leu Ala 
115 120 125 

Asn Arg Glu Leu Leu Asp Ser Trp Arg Asn Tyr Gin Trp His Gin Leu 
130 135 140 

Ala Arg Ala Phe Leu Met Tyr Val Thr Gin Arg Trp Pro Asp Val Leu 
145. 150 155 160 

Ser Thr Ala Ala Glu Asp Leu Pro Pro Gin Ala He Val Met Pro Ala 
165 170 175 

Val Thr Ala Ser He Cys Ala Leu Ala Ala His Ala Ala Ala His Leu 
180 185 190 

Gly Gin Gly Arg Val Ala Leu Asp Trp Leu Asp Arg Val Asp Val lie 
195 200 205 . 

Gly His Ser Arg Ser Ser Glu Arg Phe Gly Ala Asp Val Leu Thr -Ala 
210 215. . 220 

Ala He Gly Pro Ala Asp He Pro Leu Leu Val Ala Asp Leu Ala Tyr 
225 230 235 240 

Val Arg Gly Met Val Tyr Arg Gin Leu His Glu Glu Asp Lys Ala Gin 
245 250 255 

He Trp Leu Ser Lys Ala Thr He Asn Gly Val Leu Thr Asp Ala Ala 
260 265 270 

Lys Glu Ala Leu Ala Asp Pro Asn Leu Arg Leu lie Val Thr Asp Glu 
275 280 285 

Arg Thr He Ala Ser Arg Ser Asp Arg Trp . Asp Ala Ser Thr Ala Lys 
290 295 300 

Ser Arg Asp Gin Leu Asp Asp Asp Asn Ala Ala Gin Arg Arg Gly Glu 
305 310 315 320 

Leu Leu Ala Glu Gly Arg Glu Leu Leu Ala Lys Gin Val Gly Leu Ala 
325 330 335 

Ala Val Lys Gin Ala Val Ser Ala Leu Glu Asp Gin Leu Glu Val Arg 
340 345 350 

Met Met Arg Lexi Glu His Gly Leu Pro Val Glu Gly Gin Thr Asn His 
355 360 365 

Met Leu Leu Val Gly Pro Pro Gly Thr Gly Lys Thr Thr Thr Ala Glu 
370 375 380 . 

Ala Leu Gly Lys lie Tyr Ala Gly Met Gly He Val Arg His Pro Glu 
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385 390 395 400 

lie Arg Glu Val Arg Arg Ser Asp Phe Cys Gly His Tyr lie Gly Glu 
405 410 415 

Ser Gly Pro Lys Thr Asn Glu Leu lie Glu Lys Ser Leu Gly Arg lie 
420 425 430 

lie Phe Met Asp Glu Phe Tyr Ser Leu He Glu Arg His Gin Asp Gly 
435 440 445 

Thr Pro Asp Met He Gly Met Glu Ala Val Asn Gin Leu Leu. Val Gin 
450 * 455 460 

Leu Glu Thr His Arg Phe Asp Phe Cys Phe He Gly Ala Gly Tyr Glu 
465 470 475 480 

Asp Gin Val Asp Glu Phe Leu Thr Val Asn Pro Gly Leu Ala Gly Arg 
485 490 495 

Phe Asn Arg Lys Leu Arg Phe Glu Ser Tyr Ser Pro Val Glu He Val 
500 505 510 

Glu He Gly His Arg Tyr Ala Thr Pro Arg Ala Ser Gin Leu Asp Asp 
515 520 525 

Ala Ala Arg Glu Val Phe Leu Asp Ala Val Thr Thr He Arg Asn Tyr 
530 535 540 

Thr Thr Pro Ser Gly Gin His Gly lie Asp Ala Met Gin Asn Gly Arg 
545 550 555 560 

Phe Ala Arg Asn Val He Glu Arg Ala Glu Gly Phe Arg Asp Thr Arg 
565 570 575 

Val Val Ala Gin Lys Arg Ala Gly Gin Pro Val Ser Val Gin Asp Leu 
580 585 590 

Gin He He Thr Ala Thr Asp He Asp Ala Ala He Arg Ser Val Cys 
595 600 605 

Ser Asp Asn Arg Asp Met Ala Ala He Val Trp 
610 615 



<210> 75 
<211> 53.7 
<212> PRT 

<213> Mycobacterium tuberculosis 
<220> 

<223> Rv3885c - possible conserved membrane protein 
<400> 75 

Leu Thr Ser Lys Leu Thr Gly Phe Ser Pro Arg Ser Ala Arg Arg Val 
1 . 5 10 15 

Ala Gly Val Trp Thr Val Phe Val Leu Ala Ser Ala Gly Trp Ala Leu 
20 25 30 

Gly Gly Gin Leu Gly Ala Val Met Ala Val Val Val Gly Val Ala Leu 
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35 40 45 

Val Phe Val Gin Trp Trp Gly Gin Pro Ala Trp Ser Trp Ala Val Leu 
50 55 60 

Gly liCU Arg Gly Arg Arg Pro Val Lys Trp Asn Asp Pro lie Tiir Leu 
65 70 75 80 

Ala Asn Asn Arg Ser Gly Gly Gly Val Arg Val Gin Asp Gly Val Ala 
85 90 95 

Val Val Ala Val Gin Leu Leu Gly Arg Ala His Arg Ala Thr Thr Val 
100 105 110 

Thr Gly Ser Val Thr Val Glu Ser Asp Asn Val lie Asp Val Val Glu 
115 120 125 

Leu Ala Pro Leu Leu Arg His Pro Leu Asp Leu Glu Leu Asp Ser lie 
130 135 140 

Ser Val Val Thr Phe Gly Ser Arg Thr Gly Thr Val Gly Asp Tyr Pro 
145 150 155 160 

Arg Val Tyr Asp Ala Glu lie Gly Thr Pro Pro Tyr Ala Gly Arg Arg 
165 170 175 

Glu Thr Trp Leu lie Met Arg Leu Pro Val lie Gly Asn Thr Gin Ala 
180 185 190 

Leu Arg Trp Arg Thr Ser Val Gly Ala Ala Ala lie Ser Val Ala Gin 
195 200 205 

Arg Val Ala Ser Ser Leu Arg Cys Gin Gly Leu Arg Ala Lys Leu Ala 
210 215 220 

Thr Ala Thr Asp Leu Ala Glu Leu Asp Arg Arg Leu Gly Ser Asp Ala 
225 230 235 240 

Val Ala Gly Ser Ala Gin Arg Trp Lys Ala lie Arg Gly Glu Ala Gly 
245 250 255 

Trp Met Thr Thr Tyr Ala Tyr Pro Ala Glu Ala lie Ser Ser Arg Val 
260 265 270 

Leu Ser Gin Ala Trp Thr Leu Arg Ala Asp Glu Val lie Gin Asn Val 
275 280 285 

Thr Val Tyr Pro Asp Ala Thr Cys Thr Ala Thr He Thr . Val Arg Thr 
290 295 300 

Pro Thr Pro Ala Pro Thr Pro Pro Ser Val He Leu Arg Arg Leu Asn 
305 , 310 315 320 

Gly Glu Gin Ala Ala Ala Ala Ala Ala Asn Met Cys Gly Pro Arg Pro 
325 330 ' 335 

His Leu Arg Gly Gin Arg Arg Cys Pro Leu Pro Ala Gin Leu Val Thr 
340 345 350 . 

Glu He Gly Pro Ser Gly Val Leu He Gly Lys Leu Ser Asn Gly Asp 
355 360 365 
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Arg Leu Met lie Pro Val Thr Asp Ala Gly Glu Leu Ser Arg Val Phe 
370 375 380 

Val Ala Ala Asp Asp Thr He Ala Lys Arg He Val He Arg Val Val 
385 390 * 395 400 

Gly Ala Gly Glu Arg Val Cys Val His Thr Arg Asp Gin Glu Arg Trp 
405 410 415 

Ala Ser Val Arg Met Pro Gin Leu Ser He Val Gly Thr Pro Arg Pro 
420 425 430 

Ala Pro Arg Thr Thr Val Gly Val Val Glu Tyr Val Arg Arg Arg Lys 

435 440 445 . 

Asn Gly Asp Asp Gly Lys Ser Glu Gly. Ser Gly Val Asp Val Ala lie 
450 455 460 

Ser Pro Thr Pro Arg Pro Ala Ser Val He Thr He Ala Arg Pro Gly 
465 470 475 480 

Thr Ser Leu Ser Glu Ser Asp Arg His Gly Phe Glu Val Thr He Glu 
485 490 . 495 . 

Gin He Asp Arg Ala Thr Val Lys Val Gly Ala Ala Gly Gin Asn Trp 
500 505 510 

Leu Val Glu Met Glu Met Phe Arg Ala Glu Asn Arg Tyr Val Ser Leu 
515 520 525 



Glu Pro Val Thr Met Ser He Gly Arg 
530 535 



